Skip to yearly menu bar Skip to main content


Query-Policy Misalignment in Preference-Based Reinforcement Learning

Xiao Hu ⋅ Jianxiong Li ⋅ Xianyuan Zhan ⋅ Qing-Shan Jia ⋅ Ya-Qin Zhang

Abstract

Video

Chat is not available.