Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Next Generation of AI Safety

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

Patrick Chao · Edoardo Debenedetti · Alex Robey · Maksym Andriushchenko · Francesco Croce · Vikash Sehwag · Edgar Dobriban · Nicolas Flammarion · George J. Pappas · Florian Tramer · Hamed Hassani · Eric Wong

Abstract

Chat is not available.