What is your p(trigger)?
Nicklas, I’m about to start a PhD at CHIA in this realm. I’m considering how to make (multi)national security evals of frontier models. Do you advise any particular questions I ought to ask a model to make benchmarks along the way?
Nicklas, I’m about to start a PhD at CHIA in this realm. I’m considering how to make (multi)national security evals of frontier models. Do you advise any particular questions I ought to ask a model to make benchmarks along the way?