Skip to yearly menu bar Skip to main content


Nearly Optimal Regret for Learning Adversarial MDPs with Linear Function Approximation

Jiafan He ⋅ Dongruo Zhou ⋅ Quanquan Gu

Abstract

Chat is not available.