firstbacksecondback
356 Results
Poster
|
Thu 15:00 |
From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses Daniil Tiapkin · Denis Belomestny · Eric Moulines · Alexey Naumov · Sergey Samsonov · Yunhao Tang · Michal Valko · Pierre Menard |
|
Poster
|
Wed 15:30 |
Improved Regret for Differentially Private Exploration in Linear MDP Dung Ngo · Giuseppe Vietri · Steven Wu |
|
Oral
|
Thu 12:30 |
From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses Daniil Tiapkin · Denis Belomestny · Eric Moulines · Alexey Naumov · Sergey Samsonov · Yunhao Tang · Michal Valko · Pierre Menard |
|
Spotlight
|
Thu 13:40 |
Retrieval-Augmented Reinforcement Learning Anirudh Goyal · Abe Friesen Friesen · Andrea Banino · Theophane Weber · Nan Rosemary Ke · Adrià Puigdomenech Badia · Arthur Guez · Mehdi Mirza · Peter Humphreys · Ksenia Konyushkova · Michal Valko · Simon Osindero · Timothy Lillicrap · Nicolas Heess · Charles Blundell |
|
Poster
|
Tue 15:30 |
Scalable Deep Reinforcement Learning Algorithms for Mean Field Games Mathieu Lauriere · Sarah Perrin · Sertan Girgin · Paul Muller · Ayush Jain · Theophile Cabannes · Georgios Piliouras · Julien Perolat · Romuald Elie · Olivier Pietquin · Matthieu Geist |
|
Poster
|
Tue 15:30 |
The Primacy Bias in Deep Reinforcement Learning Evgenii Nikishin · Max Schwarzer · Pierluca D'Oro · Pierre-Luc Bacon · Aaron Courville |
|
Poster
|
Tue 15:30 |
Efficient Model-based Multi-agent Reinforcement Learning via Optimistic Equilibrium Computation Pier Giuseppe Sessa · Maryam Kamgarpour · Andreas Krause |
|
Poster
|
Thu 15:00 |
Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path Haoyuan Cai · Tengyu Ma · Simon Du |
|
Spotlight
|
Wed 13:40 |
Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime James-Michael Leahy · Bekzhan Kerimkulov · David Siska · Lukasz Szpruch |
|
Workshop
|
Non-Markovian Policies for Unsupervised Reinforcement Learning in Multiple Environments Pietro Maldini · Mirco Mutti · Riccardo De Santi · Marcello Restelli |
||
Poster
|
Thu 15:00 |
Retrieval-Augmented Reinforcement Learning Anirudh Goyal · Abe Friesen Friesen · Andrea Banino · Theophane Weber · Nan Rosemary Ke · Adrià Puigdomenech Badia · Arthur Guez · Mehdi Mirza · Peter Humphreys · Ksenia Konyushkova · Michal Valko · Simon Osindero · Timothy Lillicrap · Nicolas Heess · Charles Blundell |
|
Spotlight
|
Thu 11:45 |
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory Ruiqi Zhang · Xuezhou Zhang · Chengzhuo Ni · Mengdi Wang |