Skip to yearly menu bar Skip to main content


Poster

Data- and Variance-dependent Regret Bounds for Online Tabular MDPs

Mingyi Li ⋅ Taira Tsuchiya ⋅ Kenji Yamanishi

Abstract

Log in and register to view live content