Skip to yearly menu bar Skip to main content


(12 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Thu Jul 21 10:30 AM -- 10:50 AM (PDT) @ Room 301 - 303
Learning Bellman Complete Representations for Offline Policy Evaluation
Jonathan Chang · Kaiwen Wang · Nathan Kallus · Wen Sun
Spotlight
Thu Jul 21 10:50 AM -- 10:55 AM (PDT) @ Room 301 - 303
Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning
Nathan Kallus · Xiaojie Mao · Kaiwen Wang · Zhengyuan Zhou
Spotlight
Thu Jul 21 10:55 AM -- 11:00 AM (PDT) @ Room 301 - 303
A Simple Reward-free Approach to Constrained Reinforcement Learning
Sobhan Miryoosefi · Chi Jin
Spotlight
Thu Jul 21 11:00 AM -- 11:05 AM (PDT) @ Room 301 - 303
Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching
Jason Yecheng Ma · Andrew Shen · Dinesh Jayaraman · Osbert Bastani
Spotlight
Thu Jul 21 11:05 AM -- 11:10 AM (PDT) @ Room 301 - 303
Temporal Difference Learning for Model Predictive Control
Nicklas Hansen · Hao Su · Xiaolong Wang
Spotlight
Thu Jul 21 11:10 AM -- 11:15 AM (PDT) @ Room 301 - 303
Model Selection in Batch Policy Optimization
Jonathan Lee · George Tucker · Ofir Nachum · Bo Dai
Oral
Thu Jul 21 11:15 AM -- 11:35 AM (PDT) @ Room 301 - 303 None
Adversarially Trained Actor Critic for Offline Reinforcement Learning
Ching-An Cheng · Tengyang Xie · Nan Jiang · Alekh Agarwal
Spotlight
Thu Jul 21 11:35 AM -- 11:40 AM (PDT) @ Room 301 - 303
Optimal Estimation of Policy Gradient via Double Fitted Iteration
Chengzhuo Ni · Ruiqi Zhang · Xiang Ji · Xuezhou Zhang · Mengdi Wang
Spotlight
Thu Jul 21 11:40 AM -- 11:45 AM (PDT) @ Room 301 - 303
Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes
Hongyi Guo · Qi Cai · Yufeng Zhang · Zhuoran Yang · Zhaoran Wang
Spotlight
Thu Jul 21 11:45 AM -- 11:50 AM (PDT) @ Room 301 - 303
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Ruiqi Zhang · Xuezhou Zhang · Chengzhuo Ni · Mengdi Wang
Spotlight
Thu Jul 21 11:50 AM -- 11:55 AM (PDT) @ Room 301 - 303
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation)
Huang Bojun
Spotlight
Thu Jul 21 11:55 AM -- 12:00 PM (PDT) @ Room 301 - 303
On the Role of Discount Factor in Offline Reinforcement Learning
Hao Hu · yiqin yang · Qianchuan Zhao · Chongjie Zhang