Skip to yearly menu bar Skip to main content


Provable Offline Reinforcement Learning with Human Feedback

Wenhao Zhan ⋅ Masatoshi Uehara ⋅ Nathan Kallus ⋅ Jason Lee ⋅ Wen Sun

Abstract

Video

Chat is not available.