Skip to yearly menu bar Skip to main content


Poster

Surgery: Mitigating Harmful Fine-Tuning for Large Language Models via Attention Sink

Guozhi Liu ⋅ Weiwei Lin ⋅ Tiansheng Huang ⋅ Ruichao Mo ⋅ Qi Mu ⋅ Xiumin Wang ⋅ Li Shen

Abstract

Log in and register to view live content