Skip to yearly menu bar Skip to main content


Poster

MAGIC: A Co-Evolving Attacker–Defender Adversarial Game for Robust LLM Safety

Xiaoyu Wen ⋅ Zhida He ⋅ Han Qi ⋅ Ziyu Wan ⋅ Zhongtian Ma ⋅ Ying Wen ⋅ Tianhang Zheng ⋅ Xingcheng Xu ⋅ Chaochao Lu ⋅ Qiaosheng Zhang

Abstract

Log in and register to view live content