Skip to yearly menu bar Skip to main content


Oral

Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds

Andrea Zanette · Emma Brunskill
2019 Oral
[ Slides [ Video

Abstract

Chat is not available.