Skip to yearly menu bar Skip to main content


Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings

Ming Yin · Yu-Xiang Wang

Abstract

Chat is not available.