Skip to yearly menu bar Skip to main content


Spotlight

Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation

Xiaoyu Chen ⋅ Han Zhong ⋅ Zhuoran Yang ⋅ Zhaoran Wang ⋅ Liwei Wang
2022 Spotlight

Abstract

Video

Chat is not available.