Skip to yearly menu bar Skip to main content


(10 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Tue Jun 11 02:00 PM -- 02:20 PM (PDT) @ Room 104
Safe Policy Improvement with Baseline Bootstrapping
Romain Laroche · Paul TRICHELAIR · Remi Tachet des Combes
[ Slides [ Video
Oral
Tue Jun 11 02:20 PM -- 02:25 PM (PDT) @ Room 104
Distributional Reinforcement Learning for Efficient Exploration
Borislav Mavrin · Hengshuai Yao · Linglong Kong · Kaiwen Wu · Yaoliang Yu
[ Slides [ Video
Oral
Tue Jun 11 02:25 PM -- 02:30 PM (PDT) @ Room 104
Optimistic Policy Optimization via Multiple Importance Sampling
Matteo Papini · Alberto Maria Metelli · Lorenzo Lupo · Marcello Restelli
[ Slides [ Video
Oral
Tue Jun 11 02:30 PM -- 02:35 PM (PDT) @ Room 104
Neural Logic Reinforcement Learning
zhengyao jiang · Shan Luo
[ Slides [ Video
Oral
Tue Jun 11 02:35 PM -- 02:40 PM (PDT) @ Room 104
Learning to Collaborate in Markov Decision Processes
Goran Radanovic · Rati Devidze · David Parkes · Adish Singla
[ Slides [ Video
Oral
Tue Jun 11 02:40 PM -- 03:00 PM (PDT) @ Room 104
Predictor-Corrector Policy Optimization
Ching-An Cheng · Xinyan Yan · Nathan Ratliff · Byron Boots
[ Slides [ Video
Oral
Tue Jun 11 03:00 PM -- 03:05 PM (PDT) @ Room 104
Learning a Prior over Intent via Meta-Inverse Reinforcement Learning
Kelvin Xu · Ellis Ratner · Anca Dragan · Sergey Levine · Chelsea Finn
[ Slides [ Video
Oral
Tue Jun 11 03:05 PM -- 03:10 PM (PDT) @ Room 104
DeepMDP: Learning Continuous Latent Space Models for Representation Learning
Carles Gelada · Saurabh Kumar · Jacob Buckman · Ofir Nachum · Marc Bellemare
[ Slides [ Video
Oral
Tue Jun 11 03:10 PM -- 03:15 PM (PDT) @ Room 104
Importance Sampling Policy Evaluation with an Estimated Behavior Policy
Josiah Hanna · Scott Niekum · Peter Stone
[ Slides [ Video
Oral
Tue Jun 11 03:15 PM -- 03:20 PM (PDT) @ Room 104
Learning from a Learner
alexis jacq · Matthieu Geist · Ana Paiva · Olivier Pietquin
[ Slides [ Video