Skip to yearly menu bar Skip to main content


Poster

Preference Goal Tuning: Post-Training as Latent Control for Frozen Policies

Guangyu Zhao ⋅ Kewei Lian ⋅ Haoxuan Ru ⋅ Borong Zhang ⋅ Haowei Lin ⋅ Zhancun Mu ⋅ Haobo Fu ⋅ Qiang Fu ⋅ Shaofei Cai ⋅ Zihao Wang ⋅ Yitao Liang

Abstract

Log in and register to view live content