Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Next Generation of AI Safety

BELLS: A Framework Towards Future Proof Benchmarks for the Evaluation of LLM Safeguards

Diego Dorn ⋅ Alexandre Variengien ⋅ Charbel-Raphaël Segerie ⋅ Vincent Corruble

Abstract

Video

Chat is not available.