Skip to yearly menu bar Skip to main content


Poster
in
Workshop: RLxF: RL from World Feedback

Learning Diffusion Planners from World Feedback: A No-Go Result on Bit-Exact Safety Rewards and an ODD-Adaptive Shared/Expert Decomposition

Yun Li ⋅ Ehsan Javanmardi ⋅ Yidu Zhang ⋅ Simon Thompson ⋅ Qunli Zhang ⋅ Zifan Zeng ⋅ Shiming Liu ⋅ Peng Wang ⋅ Zixuan Guo ⋅ Manabu Tsukada

Abstract

Log in and register to view live content