Skip to yearly menu bar Skip to main content


Poster
in
Workshop: RLxF: RL from World Feedback

Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models

Samy Jelassi ⋅ Mujin Kwun ⋅ Rosie Zhao ⋅ Yuanzhi Li ⋅ Nicolò Fusi ⋅ Yilun Du ⋅ Sham Kakade ⋅ Carles Domingo i Enrich

Abstract

Log in and register to view live content