Skip to yearly menu bar Skip to main content


Poster
in
Workshop: DIG-BUGS: Data in Generative Models (The Bad, the Ugly, and the Greats)
Sat, Jul 19, 2025 • 3:00 PM – 3:45 PM PDT

A Representation Engineering Perspective on the Effectiveness of Multi-Turn Jailbreaks

Blake Bullwinkel · Mark Russinovich · Ahmed Salem · Santiago Zanella-Beguelin · Dan Jones · Giorgio Severi · Eugenia Kim · Keegan Hines · Amanda Minnich · Yonatan Zunger · Ram Shankar Siva Kumar

Abstract

Chat is not available.