Skip to yearly menu bar Skip to main content


Poster

Dense Reward for Free in Reinforcement Learning from Human Feedback

Alexander Chan · Hao Sun · Samuel Holt · M van der Schaar
2024 Poster

Abstract

Chat is not available.