Skip to yearly menu bar Skip to main content


Spotlight

Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets

Han Zhong ⋅ Wei Xiong ⋅ Jiyuan Tan ⋅ Liwei Wang ⋅ Tong Zhang ⋅ Zhaoran Wang ⋅ Zhuoran Yang
2022 Spotlight

Abstract

Video

Chat is not available.