10   Show all »
Toggle Poster Visibility
Oral
Tue Jun 11th 11:00 -- 11:20 AM @ Hall B
ELF OpenGo: an analysis and open reimplementation of AlphaZero
Yuandong Tian · Jerry Ma · Qucheng Gong · Shubho Sengupta · Zhuoyuan Chen · James Pinkerton · Larry Zitnick
Oral
Tue Jun 11th 11:20 -- 11:25 AM @ Hall B
Making Deep Q-learning methods robust to time discretization
Corentin Tallec · Leonard Blier · Yann Ollivier
Oral
Tue Jun 11th 11:25 -- 11:30 AM @ Hall B
Nonlinear Distributional Gradient Temporal-Difference Learning
chao qu · Shie Mannor · Huan Xu
Oral
Tue Jun 11th 11:30 -- 11:35 AM @ Hall B
Composing Entropic Policies using Divergence Correction
Jonathan Hunt · Andre Barreto · Timothy Lillicrap · Nicolas Heess
Oral
Tue Jun 11th 11:35 -- 11:40 AM @ Hall B
TibGM: A Transferable and Information-Based Graphical Model Approach for Reinforcement Learning
Tameem Adel · Adrian Weller
Oral
Tue Jun 11th 11:40 AM -- 12:00 PM @ Hall B
Multi-Agent Adversarial Inverse Reinforcement Learning
Lantao Yu · Jiaming Song · Stefano Ermon
Oral
Tue Jun 11th 12:00 -- 12:05 PM @ Hall B
Policy Consolidation for Continual Reinforcement Learning
Christos Kaplanis · Murray Shanahan · Claudia Clopath
Oral
Tue Jun 11th 12:05 -- 12:10 PM @ Hall B
Off-Policy Deep Reinforcement Learning without Exploration
Scott Fujimoto · David Meger · Doina Precup
Oral
Tue Jun 11th 12:10 -- 12:15 PM @ Hall B
Random Expert Distillation: Imitation Learning via Expert Policy Support Estimation
Ruohan Wang · Carlo Ciliberto · Pierluigi Vito Amadori · Yiannis Demiris
Oral
Tue Jun 11th 12:15 -- 12:20 PM @ Hall B
Revisiting the Softmax Bellman Operator: New Benefits and New Perspective
Zhao Song · Ron Parr · Lawrence Carin