Skip to yearly menu bar Skip to main content


Poster

The Cylindrical Representation Hypothesis for Language Model Steering

Lang Gao ⋅ Jinghui Zhang ⋅ Wei Liu ⋅ Fengxian Ji ⋅ Chenxi Wang ⋅ Zirui Song ⋅ Akash Ghosh ⋅ Youssef Mohamed ⋅ Preslav Nakov ⋅ Xiuying Chen

Abstract

Log in and register to view live content