Skip to yearly menu bar Skip to main content


(10 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Tue Jun 11 11:00 AM -- 11:20 AM (PDT) @ Hall B
ELF OpenGo: an analysis and open reimplementation of AlphaZero
Yuandong Tian · Jerry Ma · Qucheng Gong · Shubho Sengupta · Zhuoyuan Chen · James Pinkerton · Larry Zitnick
[ Video
Oral
Tue Jun 11 11:20 AM -- 11:25 AM (PDT) @ Hall B
Making Deep Q-learning methods robust to time discretization
Corentin Tallec · Leonard Blier · Yann Ollivier
[ Slides [ Video
Oral
Tue Jun 11 11:25 AM -- 11:30 AM (PDT) @ Hall B
Nonlinear Distributional Gradient Temporal-Difference Learning
chao qu · Shie Mannor · Huan Xu
[ Slides [ Video
Oral
Tue Jun 11 11:30 AM -- 11:35 AM (PDT) @ Hall B
Composing Entropic Policies using Divergence Correction
Jonathan Hunt · Andre Barreto · Timothy Lillicrap · Nicolas Heess
[ Slides [ Video
Oral
Tue Jun 11 11:35 AM -- 11:40 AM (PDT) @ Hall B
TibGM: A Transferable and Information-Based Graphical Model Approach for Reinforcement Learning
Tameem Adel · Adrian Weller
[ Slides [ Video
Oral
Tue Jun 11 11:40 AM -- 12:00 PM (PDT) @ Hall B
Multi-Agent Adversarial Inverse Reinforcement Learning
Lantao Yu · Jiaming Song · Stefano Ermon
[ Slides [ Video
Oral
Tue Jun 11 12:00 PM -- 12:05 PM (PDT) @ Hall B
Policy Consolidation for Continual Reinforcement Learning
Christos Kaplanis · Murray Shanahan · Claudia Clopath
[ Slides [ Video
Oral
Tue Jun 11 12:05 PM -- 12:10 PM (PDT) @ Hall B
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto · David Meger · Doina Precup
[ Slides [ Video
Oral
Tue Jun 11 12:10 PM -- 12:15 PM (PDT) @ Hall B
Random Expert Distillation: Imitation Learning via Expert Policy Support Estimation
Ruohan Wang · Carlo Ciliberto · Pierluigi Vito Amadori · Yiannis Demiris
[ Slides [ Video
Oral
Tue Jun 11 12:15 PM -- 12:20 PM (PDT) @ Hall B
Revisiting the Softmax Bellman Operator: New Benefits and New Perspective
Zhao Song · Ron Parr · Lawrence Carin
[ Slides [ Video