Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Next Generation of AI Safety

Can Editing LLMs Inject Harm?

Canyu Chen ⋅ Baixiang Huang ⋅ Zekun Li ⋅ Zhaorun Chen ⋅ Shiyang Lai ⋅ Xiongxiao Xu ⋅ Jia-Chen Gu ⋅ Jindong Gu ⋅ Huaxiu Yao ⋅ Chaowei Xiao ⋅ Xifeng Yan ⋅ William Wang ⋅ Phil Torr ⋅ Dawn Song ⋅ Kai Shu

Abstract

Video

Chat is not available.