Skip to yearly menu bar Skip to main content


Poster

Dense Reward for Free in Reinforcement Learning from Human Feedback

Alexander Chan ⋅ Hao Sun ⋅ Samuel Holt ⋅ M van der Schaar
2024 Poster

Abstract

Chat is not available.