Timezone: »
Poster
Deeply-Debiased Off-Policy Interval Estimation
Chengchun Shi · Runzhe Wan · Victor Chernozhukov · Rui Song
Off-policy evaluation learns a target policy's value with a historical dataset generated by a different behavior policy. In addition to a point estimate, many applications would benefit significantly from having a confidence interval (CI) that quantifies the uncertainty of the point estimate. In this paper, we propose a novel procedure to construct an efficient, robust, and flexible CI on a target policy's value. Our method is justified by theoretical results and numerical experiments. A Python implementation of the proposed procedure is available at https://github.com/ RunzheStat/D2OPE.
Author Information
Chengchun Shi (London School of Economics and Political Science)
Runzhe Wan (North Carolina State University)
Victor Chernozhukov (MIT)
Rui Song (North Carolina State University)
Related Events (a corresponding poster, oral, or spotlight)
-
2021 Oral: Deeply-Debiased Off-Policy Interval Estimation »
Tue. Jul 20th 12:00 -- 12:20 PM Room
More from the Same Authors
-
2022 : Adversarial Estimation of Riesz Representers »
Victor Chernozhukov · Whitney Newey · Rahul Singh · Vasilis Syrgkanis -
2023 Poster: Multiplier Bootstrap-based Exploration »
Runzhe Wan · Haoyu Wei · Branislav Kveton · Rui Song -
2023 Poster: An Instrumental Variable Approach to Confounded Off-Policy Evaluation »
Yang Xu · Jin Zhu · Chengchun Shi · Shikai Luo · Rui Song -
2023 Poster: A Robust Test for the Stationarity Assumption in Sequential Decision Making »
Jitao Wang · Chengchun Shi · Zhenke Wu -
2023 Poster: A Reinforcement Learning Framework for Dynamic Mediation Analysis »
Lin Ge · Jitao Wang · Chengchun Shi · Zhenke Wu · Rui Song -
2022 Poster: Safe Exploration for Efficient Policy Evaluation and Comparison »
Runzhe Wan · Branislav Kveton · Rui Song -
2022 Spotlight: Safe Exploration for Efficient Policy Evaluation and Comparison »
Runzhe Wan · Branislav Kveton · Rui Song -
2022 Poster: A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes »
Chengchun Shi · Masatoshi Uehara · Jiawei Huang · Nan Jiang -
2022 Oral: A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes »
Chengchun Shi · Masatoshi Uehara · Jiawei Huang · Nan Jiang -
2020 Poster: Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making »
Chengchun Shi · Runzhe Wan · Rui Song · Wenbin Lu · Ling Leng -
2020 Poster: Causal Effect Estimation and Optimal Dose Suggestions in Mobile Health »
Liangyu Zhu · Wenbin Lu · Rui Song -
2020 Poster: On Validation and Planning of An Optimal Decision Rule with Application in Healthcare Studies »
Hengrui Cai · Wenbin Lu · Rui Song