Skip to yearly menu bar Skip to main content


(10 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Tue Jun 11 02:00 PM -- 02:20 PM (PDT) @ Hall B
An Investigation of Model-Free Planning
Arthur Guez · Mehdi Mirza · Karol Gregor · Rishabh Kabra · Sebastien Racaniere · Theophane Weber · David Raposo · Adam Santoro · Laurent Orseau · Tom Eccles · Greg Wayne · David Silver · Timothy Lillicrap
[ Video
Oral
Tue Jun 11 02:20 PM -- 02:25 PM (PDT) @ Hall B
CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning
Cédric Colas · Pierre-Yves Oudeyer · Olivier Sigaud · Pierre Fournier · Mohamed Chetouani
[ Slides [ Video
Oral
Tue Jun 11 02:25 PM -- 02:30 PM (PDT) @ Hall B
Task-Agnostic Dynamics Priors for Deep Reinforcement Learning
Yilun Du · Karthik Narasimhan
[ Slides [ Video
Oral
Tue Jun 11 02:30 PM -- 02:35 PM (PDT) @ Hall B
Diagnosing Bottlenecks in Deep Q-learning Algorithms
Justin Fu · Aviral Kumar · Matthew Soh · Sergey Levine
[ Slides [ Video
Oral
Tue Jun 11 02:35 PM -- 02:40 PM (PDT) @ Hall B
Collaborative Evolutionary Reinforcement Learning
Shauharda Khadka · Somdeb Majumdar · Tarek Nassar · Zach Dwiel · Evren Tumer · Santiago Miret · Yinyin Liu · Kagan Tumer
[ Slides [ Video
Oral
Tue Jun 11 02:40 PM -- 03:00 PM (PDT) @ Hall B
EMI: Exploration with Mutual Information
Hyoungseok Kim · Jaekyeom Kim · Yeonwoo Jeong · Sergey Levine · Hyun Oh Song
[ Video
Oral
Tue Jun 11 03:00 PM -- 03:05 PM (PDT) @ Hall B
Imitation Learning from Imperfect Demonstration
Yueh-Hua Wu · Nontawat Charoenphakdee · Han Bao · Voot Tangkaratt · Masashi Sugiyama
[ Slides [ Video
Oral
Tue Jun 11 03:05 PM -- 03:10 PM (PDT) @ Hall B
Curiosity-Bottleneck: Exploration By Distilling Task-Specific Novelty
Youngjin Kim · Daniel Nam · Hyunwoo Kim · Ji-Hoon Kim · Gunhee Kim
[ Slides [ Video
Oral
Tue Jun 11 03:10 PM -- 03:15 PM (PDT) @ Hall B
Dynamic Weights in Multi-Objective Deep Reinforcement Learning
Axel Abels · Diederik Roijers · Tom Lenaerts · Ann Nowé · Denis Steckelmacher
[ Slides [ Video
Oral
Tue Jun 11 03:15 PM -- 03:20 PM (PDT) @ Hall B
Fingerprint Policy Optimisation for Robust Reinforcement Learning
Supratik Paul · Michael A Osborne · Shimon Whiteson
[ Slides [ Video