Skip to yearly menu bar Skip to main content


Oral

A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes

Chengchun Shi ⋅ Masatoshi Uehara ⋅ Jiawei Huang ⋅ Nan Jiang
2022 Oral

Abstract

Video

Chat is not available.