Skip to yearly menu bar Skip to main content


Nearly Minimax Optimal Reinforcement Learning for Discounted MDPs

Jiafan He · Dongruo Zhou · Quanquan Gu

Abstract

Chat is not available.