Skip to yearly menu bar Skip to main content


Poster
in
Workshop: RLxF: RL from World Feedback

Right in the Right Way: Combining Verifiable Rewards with Human Demonstrations

Mehul Damani ⋅ Isha Puri ⋅ Idan Shenfeld ⋅ Jacob Andreas

Abstract

Log in and register to view live content