Skip to yearly menu bar Skip to main content


Poster
in
Workshop: RLxF: RL from World Feedback

Preference Alignment Improves Information Conveyance in Language Models

Yuwei Cheng ⋅ Weiyi Tian ⋅ Haifeng Xu

Abstract

Log in and register to view live content