(10 events)   Timezone: »  
Show all »
Toggle Poster Visibility
Oral
Wed Jun 12 02:00 PM -- 02:20 PM (PDT) @ Hall B
The Natural Language of Actions
Guy Tennenholtz · Shie Mannor
Oral
Wed Jun 12 02:20 PM -- 02:25 PM (PDT) @ Hall B
Control Regularization for Reduced Variance Reinforcement Learning
Richard Cheng · Abhinav Verma · Gabor Orosz · Swarat Chaudhuri · Yisong Yue · Joel Burdick
Oral
Wed Jun 12 02:25 PM -- 02:30 PM (PDT) @ Hall B
On the Generalization Gap in Reparameterizable Reinforcement Learning
Huan Wang · Stephan Zheng · Caiming Xiong · Richard Socher
Oral
Wed Jun 12 02:30 PM -- 02:35 PM (PDT) @ Hall B
Trajectory-Based Off-Policy Deep Reinforcement Learning
Andreas Doerr · Michael Volpp · Marc Toussaint · Sebastian Trimpe · Christian Daniel
Oral
Wed Jun 12 02:35 PM -- 02:40 PM (PDT) @ Hall B
A Deep Reinforcement Learning Perspective on Internet Congestion Control
Nathan Jay · Noga H. Rotman · Brighten Godfrey · Michael Schapira · Aviv Tamar
Oral
Wed Jun 12 02:40 PM -- 03:00 PM (PDT) @ Hall B
Model-Based Active Exploration
Pranav Shyam · Wojciech Jaśkowski · Faustino Gomez
Oral
Wed Jun 12 03:00 PM -- 03:05 PM (PDT) @ Hall B
Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations
Daniel Brown · Wonjoon Goo · Prabhat Nagarajan · Scott Niekum
Oral
Wed Jun 12 03:05 PM -- 03:10 PM (PDT) @ Hall B
Distributional Multivariate Policy Evaluation and Exploration with the Bellman GAN
dror freirich · Tzahi Shimkin · Ron Meir · Aviv Tamar
Oral
Wed Jun 12 03:10 PM -- 03:15 PM (PDT) @ Hall B
A Baseline for Any Order Gradient Estimation in Stochastic Computation Graphs
Jingkai Mao · Jakob Foerster · Tim Rocktäschel · Maruan Al-Shedivat · Gregory Farquhar · Shimon Whiteson
Oral
Wed Jun 12 03:15 PM -- 03:20 PM (PDT) @ Hall B
Remember and Forget for Experience Replay
Guido Novati · Petros Koumoutsakos