Skip to yearly menu bar Skip to main content


Aligning Large Language Models with Representation Editing: A Control Perspective

Lingkai Kong · Haorui Wang · Wenhao Mu · Yuanqi Du · Yuchen Zhuang · Yifei Zhou · Yue Song · Rongzhi Zhang · Kai Wang · Chao Zhang

Abstract

Video

Chat is not available.