Skip to yearly menu bar Skip to main content


Bayesian Reward Models for LLM Alignment

Adam Yang ⋅ Maxime Robeyns ⋅ Thomas Coste ⋅ zhengxiang shi ⋅ Jun Wang ⋅ Haitham Bou Ammar ⋅ Laurence Aitchison

Abstract

Video

Chat is not available.