firstbacksecondback
72 Results
Spotlight
|
Thu 12:45 |
Safe Exploration for Efficient Policy Evaluation and Comparison Runzhe Wan · Branislav Kveton · Rui Song |
|
Spotlight
|
Thu 8:55 |
Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation ZHIHAN LIU · Yufeng Zhang · Zuyue Fu · Zhuoran Yang · Zhaoran Wang |
|
Spotlight
|
Thu 11:35 |
Optimal Estimation of Policy Gradient via Double Fitted Iteration Chengzhuo Ni · Ruiqi Zhang · Xiang Ji · Xuezhou Zhang · Mengdi Wang |
|
Spotlight
|
Thu 8:50 |
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations Haoran Xu · Xianyuan Zhan · Honglei Yin · Huiling qin |
|
Spotlight
|
Wed 14:00 |
Constrained Offline Policy Optimization Nicholas Polosky · Bruno C. da Silva · Madalina Fiterau · Jithin Jagannath |
|
Poster
|
Thu 15:00 |
Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation ZHIHAN LIU · Yufeng Zhang · Zuyue Fu · Zhuoran Yang · Zhaoran Wang |
|
Poster
|
Thu 15:00 |
Safe Exploration for Efficient Policy Evaluation and Comparison Runzhe Wan · Branislav Kveton · Rui Song |
|
Poster
|
Thu 15:00 |
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations Haoran Xu · Xianyuan Zhan · Honglei Yin · Huiling qin |
|
Poster
|
Thu 15:00 |
Optimal Estimation of Policy Gradient via Double Fitted Iteration Chengzhuo Ni · Ruiqi Zhang · Xiang Ji · Xuezhou Zhang · Mengdi Wang |
|
Spotlight
|
Thu 11:00 |
Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching Jason Yecheng Ma · Andrew Shen · Dinesh Jayaraman · Osbert Bastani |
|
Poster
|
Wed 15:30 |
Constrained Offline Policy Optimization Nicholas Polosky · Bruno C. da Silva · Madalina Fiterau · Jithin Jagannath |
|
Spotlight
|
Tue 11:45 |
Supervised Off-Policy Ranking Yue Jin · Yue Zhang · Tao Qin · Xudong Zhang · Jian Yuan · Houqiang Li · Tie-Yan Liu |