Skip to yearly menu bar Skip to main content


Nearly Minimax Optimal Reinforcement Learning for Discounted MDPs

Jiafan He ⋅ Dongruo Zhou ⋅ Quanquan Gu

Abstract

Chat is not available.