firstbacksecondback
36 Results
Spotlight
|
Thu 8:55 |
Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation ZHIHAN LIU · Yufeng Zhang · Zuyue Fu · Zhuoran Yang · Zhaoran Wang |
|
Poster
|
Thu 15:00 |
Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation ZHIHAN LIU · Yufeng Zhang · Zuyue Fu · Zhuoran Yang · Zhaoran Wang |
|
Spotlight
|
Thu 7:50 |
Continuous Control with Action Quantization from Demonstrations Robert Dadashi · Léonard Hussenot · Damien Vincent · Sertan Girgin · Anton Raichuk · Matthieu Geist · Olivier Pietquin |
|
Poster
|
Thu 15:00 |
Continuous Control with Action Quantization from Demonstrations Robert Dadashi · Léonard Hussenot · Damien Vincent · Sertan Girgin · Anton Raichuk · Matthieu Geist · Olivier Pietquin |
|
Spotlight
|
Tue 11:00 |
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels Edoardo Cetin · Philip Ball · Stephen Roberts · Oya Celiktutan |
|
Poster
|
Tue 15:30 |
Stabilizing Off-Policy Deep Reinforcement Learning from Pixels Edoardo Cetin · Philip Ball · Stephen Roberts · Oya Celiktutan |
|
Spotlight
|
Wed 13:40 |
Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime James-Michael Leahy · Bekzhan Kerimkulov · David Siska · Lukasz Szpruch |
|
Poster
|
Wed 15:30 |
Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime James-Michael Leahy · Bekzhan Kerimkulov · David Siska · Lukasz Szpruch |
|
Spotlight
|
Thu 11:55 |
Constrained Variational Policy Optimization for Safe Reinforcement Learning Zuxin Liu · Zhepeng Cen · Vladislav Isenbaev · Wei Liu · Steven Wu · Bo Li · Ding Zhao |
|
Poster
|
Thu 15:00 |
Constrained Variational Policy Optimization for Safe Reinforcement Learning Zuxin Liu · Zhepeng Cen · Vladislav Isenbaev · Wei Liu · Steven Wu · Bo Li · Ding Zhao |
|
Spotlight
|
Thu 13:00 |
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning Adam Villaflor · Zhe Huang · Swapnil Pande · John Dolan · Jeff Schneider |
|
Poster
|
Thu 15:00 |
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning Adam Villaflor · Zhe Huang · Swapnil Pande · John Dolan · Jeff Schneider |