Skip to yearly menu bar Skip to main content


Poster Wed, Jul 16, 2025 • 11:00 AM – 1:30 PM PDT

Weak-to-Strong Jailbreaking on Large Language Models

Xuandong Zhao · Xianjun Yang · Tianyu Pang · Chao Du · Lei Li · Yu-Xiang Wang · William Wang

Abstract

Lay Summary

Video

Chat is not available.