Skip to yearly menu bar Skip to main content


Provable Model-based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual Curvature

Kefan Dong · Jiaqi Yang · Tengyu Ma

Abstract

Chat is not available.