Skip to yearly menu bar Skip to main content


Poster

Human Alignment of Large Language Models through Online Preference Optimisation

Daniele Calandriello · Zhaohan Guo · REMI MUNOS · Mark Rowland · Yunhao Tang · Bernardo Avila Pires · Pierre Richemond · Charline Le Lan · Michal Valko · Tianqi Liu · Rishabh Joshi · Zeyu Zheng · Bilal Piot
2024 Poster

Abstract

Chat is not available.