Skip to yearly menu bar Skip to main content


Oral

On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game

Shuang Qiu · Jieping Ye · Zhaoran Wang · Zhuoran Yang
2021 Oral

Abstract

Video

Chat is not available.