Skip to yearly menu bar Skip to main content


Query-Policy Misalignment in Preference-Based Reinforcement Learning

Xiao Hu · Jianxiong Li · Xianyuan Zhan · Qing-Shan Jia · Ya-Qin Zhang

Abstract

Video

Chat is not available.