Skip to yearly menu bar Skip to main content


Provable Offline Reinforcement Learning with Human Feedback

Wenhao Zhan · Masatoshi Uehara · Nathan Kallus · Jason Lee · Wen Sun

Abstract

Video

Chat is not available.