Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Next Generation of AI Safety

Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks

Maksym Andriushchenko · Francesco Croce · Nicolas Flammarion

Abstract

Video

Chat is not available.