Skip to yearly menu bar Skip to main content


Poster

Metis: Learning to Jailbreak LLMs via Self-Evolving Metacognitive Policy Optimization

Huilin Zhou ⋅ Jian Zhao ⋅ Yilu Zhong ⋅ Zhen Liang ⋅ Xiuyuan Chen ⋅ Yuchen Yuan ⋅ Tianle Zhang ⋅ Chi Zhang ⋅ Lan Zhang ⋅ Xuelong Li

Abstract

Log in and register to view live content