Skip to yearly menu bar Skip to main content


Poster
in
Workshop: RLxF: RL from World Feedback

Real-world Reinforcement Learning from Suboptimal Interventions

yinuo zhao ⋅ Huiqian Jin ⋅ Lechun Jiang ⋅ Xinyi Zhang ⋅ Kun Wu ⋅ Pei Ren ⋅ Zhiyuan Xu ⋅ Zhengping Che ⋅ Junjie Ji ⋅ Lei Sun ⋅ Dapeng Wu ⋅ Chi Harold Liu ⋅ Jian Tang

Abstract

Log in and register to view live content