firstbacksecondback
58 Results
Poster
|
Thu 15:00 |
Constrained Variational Policy Optimization for Safe Reinforcement Learning Zuxin Liu · Zhepeng Cen · Vladislav Isenbaev · Wei Liu · Steven Wu · Bo Li · Ding Zhao |
|
Poster
|
Thu 15:00 |
Winning the Lottery Ahead of Time: Efficient Early Network Pruning John Rachwan · Daniel Zügner · Bertrand Charpentier · Simon Geisler · Morgane Ayle · Stephan Günnemann |
|
Spotlight
|
Thu 11:50 |
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation) Huang Bojun |
|
Spotlight
|
Wed 10:40 |
DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning Hassam Sheikh · Kizza Nandyose Frisbee · mariano phielipp |
|
Spotlight
|
Wed 14:55 |
Goal Misgeneralization in Deep Reinforcement Learning Lauro Langosco di Langosco · Jack Koch · Lee Sharkey · Jacob Pfau · David Krueger |
|
Spotlight
|
Thu 11:40 |
Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes Hongyi Guo · Qi Cai · Yufeng Zhang · Zhuoran Yang · Zhaoran Wang |
|
Poster
|
Wed 15:30 |
DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning Hassam Sheikh · Kizza Nandyose Frisbee · mariano phielipp |
|
Poster
|
Thu 15:00 |
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation) Huang Bojun |
|
Spotlight
|
Thu 11:45 |
Tell me why! Explanations support learning relational and causal structure Andrew Lampinen · Nicholas Roy · Ishita Dasgupta · Stephanie Chan · Allison Tam · James McClelland · Chen Yan · Adam Santoro · Neil Rabinowitz · Jane Wang · Feilx Hill |
|
Poster
|
Wed 15:30 |
Goal Misgeneralization in Deep Reinforcement Learning Lauro Langosco di Langosco · Jack Koch · Lee Sharkey · Jacob Pfau · David Krueger |
|
Poster
|
Thu 15:00 |
Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes Hongyi Guo · Qi Cai · Yufeng Zhang · Zhuoran Yang · Zhaoran Wang |
|
Spotlight
|
Wed 8:35 |
A data-driven approach for learning to control computers Peter Humphreys · David Raposo · Tobias Pohlen · Gregory Thornton · Rachita Chhaparia · Alistair Muldal · Josh Abramson · Petko Georgiev · Adam Santoro · Timothy Lillicrap |