Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Next Generation of AI Safety

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

Patrick Chao ⋅ Edoardo Debenedetti ⋅ Alex Robey ⋅ Maksym Andriushchenko ⋅ Francesco Croce ⋅ Vikash Sehwag ⋅ Edgar Dobriban ⋅ Nicolas Flammarion ⋅ George J. Pappas ⋅ Florian Tramer ⋅ Hamed Hassani ⋅ Eric Wong

Abstract

Chat is not available.