firstbacksecondback
350 Results
Poster
|
Tue 7:00 |
Provably Efficient Exploration in Policy Optimization Qi Cai · Zhuoran Yang · Chi Jin · Zhaoran Wang |
|
Poster
|
Wed 12:00 |
CoMic: Complementary Task Learning & Mimicry for Reusable Skills Leonard Hasenclever · Fabio Pardo · Raia Hadsell · Nicolas Heess · Josh Merel |
|
Poster
|
Thu 6:00 |
Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation Yaqi Duan · Zeyu Jia · Mengdi Wang |
|
Poster
|
Thu 12:00 |
The k-tied Normal Distribution: A Compact Parameterization of Gaussian Mean Field Posteriors in Bayesian Neural Networks Jakub Swiatkowski · Kevin Roth · Bastiaan Veeling · Linh Tran · Joshua V Dillon · Jasper Snoek · Stephan Mandt · Tim Salimans · Rodolphe Jenatton · Sebastian Nowozin |
|
Poster
|
Tue 10:00 |
Optimally Solving Two-Agent Decentralized POMDPs Under One-Sided Information Sharing Yuxuan Xie · Jilles Dibangoye · Olivier Buffet |
|
Workshop
|
Sat 9:10 |
Machine Learning for Media Discovery Erik Schmidt · Oriol Nieto · Fabien Gouyon · Yves Raimond · Katherine Kinnaird · Gert Lanckriet |
|
Poster
|
Thu 7:00 |
Clinician-in-the-Loop Decision Making: Reinforcement Learning with Near-Optimal Set-Valued Policies Shengpu Tang · Aditya Modi · Michael Sjoding · Jenna Wiens |
|
Poster
|
Wed 8:00 |
Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism Wang Chi Cheung · David Simchi-Levi · Ruihao Zhu |
|
Poster
|
Tue 18:00 |
Striving for Simplicity and Performance in Off-Policy DRL: Output Normalization and Non-Uniform Sampling Che Wang · Yanqiu Wu · Quan Vuong · Keith Ross |
|
Workshop
|
Fri 6:30 |
Theoretical Foundations of Reinforcement Learning Emma Brunskill · Thodoris Lykouris · Max Simchowitz · Wen Sun · Mengdi Wang |
|
Poster
|
Tue 12:00 |
A distributional view on multi-objective policy optimization Abbas Abdolmaleki · Sandy Huang · Leonard Hasenclever · Michael Neunert · Francis Song · Martina Zambelli · Murilo Martins · Nicolas Heess · Raia Hadsell · Martin Riedmiller |
|
Poster
|
Wed 5:00 |
Stabilizing Transformers for Reinforcement Learning Emilio Parisotto · Francis Song · Jack Rae · Razvan Pascanu · Caglar Gulcehre · Siddhant Jayakumar · Max Jaderberg · Raphael Lopez Kaufman · Aidan Clark · Seb Noury · Matthew Botvinick · Nicolas Heess · Raia Hadsell |