Skip to yearly menu bar Skip to main content


Poster
in
Workshop: RLxF: RL from World Feedback

When Is World Feedback Transferable? A Convergence Gate in Contrastive Reinforcement Learning

Bruce C Xu ⋅ Jay J Park ⋅ Vivek Buch

Abstract

Log in and register to view live content