Skip to yearly menu bar Skip to main content


Poster

Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation

Xiaoyu Chen ⋅ Han Zhong ⋅ Zhuoran Yang ⋅ Zhaoran Wang ⋅ Liwei Wang
2022 Poster

Abstract

Chat is not available.