Skip to yearly menu bar Skip to main content


Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual Curvature

Kefan Dong ⋅ Jiaqi Yang ⋅ Tengyu Ma

Abstract

Chat is not available.