Skip to yearly menu bar Skip to main content


Poster

Human Alignment of Large Language Models through Online Preference Optimisation

Daniele Calandriello ⋅ Zhaohan Guo ⋅ REMI MUNOS ⋅ Mark Rowland ⋅ Yunhao Tang ⋅ Bernardo Avila Pires ⋅ Pierre Richemond ⋅ Charline Le Lan ⋅ Michal Valko ⋅ Tianqi Liu ⋅ Rishabh Joshi ⋅ Zeyu Zheng ⋅ Bilal Piot
2024 Poster

Abstract

Chat is not available.