Skip to yearly menu bar Skip to main content


Invited Talk
in
Workshop: Humans, Algorithmic Decision-Making and Society: Modeling Interactions and Impact

Update from UK Gov's AI Safety Institute - Evals & Advancing AI Governance

Cozmin Ududec

[ ]
Sat 27 Jul 1:10 a.m. PDT — 1:30 a.m. PDT

Abstract:

As AI becomes more capable, it's crucial that governments have the capabilities to empirically understand and respond to risks. We've been building a research startup within the government to achieve that. Initially, our team of ML researchers focused on building LLM evaluations for societal impacts, dangerous capabilities, the effectiveness of safeguards, and agentic capabilities. We're now broadening our work, e.g. to study how we could predict specific capabilities and by launching a systemic safety grants program. In this talk, we'll provide an update on our technical and governance work.

Chat is not available.