firstbacksecondback
91 Results
Spotlight
|
Tue 11:55 |
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning Shentao Yang · Yihao Feng · Shujian Zhang · Mingyuan Zhou |
|
Poster
|
Wed 15:30 |
REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer Xingyu Liu · Deepak Pathak · Kris Kitani |
|
Oral
|
Tue 11:15 |
Offline RL Policies Should Be Trained to be Adaptive Dibya Ghosh · Anurag Ajay · Pulkit Agrawal · Sergey Levine |
|
Spotlight
|
Thu 11:35 |
Optimal Estimation of Policy Gradient via Double Fitted Iteration Chengzhuo Ni · Ruiqi Zhang · Xiang Ji · Xuezhou Zhang · Mengdi Wang |
|
Poster
|
Thu 15:00 |
Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation ZHIHAN LIU · Yufeng Zhang · Zuyue Fu · Zhuoran Yang · Zhaoran Wang |
|
Spotlight
|
Thu 13:45 |
Robust Policy Learning over Multiple Uncertainty Sets Annie Xie · Shagun Sodhani · Chelsea Finn · Joelle Pineau · Amy Zhang |
|
Poster
|
Thu 15:00 |
A Parametric Class of Approximate Gradient Updates for Policy Optimization Ramki Gummadi · Saurabh Kumar · Junfeng Wen · Dale Schuurmans |
|
Poster
|
Thu 15:00 |
Model Selection in Batch Policy Optimization Jonathan Lee · George Tucker · Ofir Nachum · Bo Dai |
|
Poster
|
Thu 15:00 |
Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization Minghuan Liu · Zhengbang Zhu · Yuzheng Zhuang · Weinan Zhang · Jianye Hao · Yong Yu · Jun Wang |
|
Poster
|
Thu 15:00 |
Safe Exploration for Efficient Policy Evaluation and Comparison Runzhe Wan · Branislav Kveton · Rui Song |
|
Spotlight
|
Tue 11:00 |
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels Edoardo Cetin · Philip Ball · Stephen Roberts · Oya Celiktutan |
|
Spotlight
|
Thu 12:50 |
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces Amrit Singh Bedi · Souradip Chakraborty · Anjaly Parayil · Brian Sadler · Pratap Tokekar · Alec Koppel |