10   Show all »
Toggle Poster Visibility
Oral
Tue Jun 11th 02:00 -- 02:20 PM @ Room 104
Safe Policy Improvement with Baseline Bootstrapping
Romain Laroche · Paul TRICHELAIR · Remi Tachet des Combes
Oral
Tue Jun 11th 02:20 -- 02:25 PM @ Room 104
Distributional Reinforcement Learning for Efficient Exploration
Borislav Mavrin · Hengshuai Yao · Linglong Kong · Kaiwen Wu · Yaoliang Yu
Oral
Tue Jun 11th 02:25 -- 02:30 PM @ Room 104
Optimistic Policy Optimization via Multiple Importance Sampling
Matteo Papini · Alberto Maria Metelli · Lorenzo Lupo · Marcello Restelli
Oral
Tue Jun 11th 02:30 -- 02:35 PM @ Room 104
Neural Logic Reinforcement Learning
zhengyao jiang · Shan Luo
Oral
Tue Jun 11th 02:35 -- 02:40 PM @ Room 104
Learning to Collaborate in Markov Decision Processes
Goran Radanovic · Rati Devidze · David Parkes · Adish Singla
Oral
Tue Jun 11th 02:40 -- 03:00 PM @ Room 104
Predictor-Corrector Policy Optimization
Ching-An Cheng · Xinyan Yan · Nathan Ratliff · Byron Boots
Oral
Tue Jun 11th 03:00 -- 03:05 PM @ Room 104
Learning a Prior over Intent via Meta-Inverse Reinforcement Learning
Kelvin Xu · Ellis Ratner · EECS Anca Dragan · Sergey Levine · Chelsea Finn
Oral
Tue Jun 11th 03:05 -- 03:10 PM @ Room 104
DeepMDP: Learning Continuous Latent Space Models for Representation Learning
Carles Gelada · Saurabh Kumar · Jacob Buckman · Ofir Nachum · Marc Bellemare
Oral
Tue Jun 11th 03:10 -- 03:15 PM @ Room 104
Importance Sampling Policy Evaluation with an Estimated Behavior Policy
Josiah Hanna · Scott Niekum · Peter Stone
Oral
Tue Jun 11th 03:15 -- 03:20 PM @ Room 104
Learning from a Learner
alexis jacq · Matthieu Geist · Ana Paiva · Olivier Pietquin