Skip to yearly menu bar Skip to main content


Nearly Optimal Regret for Learning Adversarial MDPs with Linear Function Approximation

Jiafan He · Dongruo Zhou · Quanquan Gu

Abstract

Chat is not available.