ICML 2022

Poster

Thu 15:00

From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses
Daniil Tiapkin · Denis Belomestny · Eric Moulines · Alexey Naumov · Sergey Samsonov · Yunhao Tang · Michal Valko · Pierre Menard

Poster

Wed 15:30

Improved Regret for Differentially Private Exploration in Linear MDP
Dung Ngo · Giuseppe Vietri · Steven Wu

Oral

Thu 12:30

From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses
Daniil Tiapkin · Denis Belomestny · Eric Moulines · Alexey Naumov · Sergey Samsonov · Yunhao Tang · Michal Valko · Pierre Menard

Spotlight

Thu 13:40

Retrieval-Augmented Reinforcement Learning
Anirudh Goyal · Abe Friesen Friesen · Andrea Banino · Theophane Weber · Nan Rosemary Ke · Adrià Puigdomenech Badia · Arthur Guez · Mehdi Mirza · Peter Humphreys · Ksenia Konyushkova · Michal Valko · Simon Osindero · Timothy Lillicrap · Nicolas Heess · Charles Blundell

Poster

Tue 15:30

Scalable Deep Reinforcement Learning Algorithms for Mean Field Games
Mathieu Lauriere · Sarah Perrin · Sertan Girgin · Paul Muller · Ayush Jain · Theophile Cabannes · Georgios Piliouras · Julien Perolat · Romuald Elie · Olivier Pietquin · Matthieu Geist

Poster

Tue 15:30

The Primacy Bias in Deep Reinforcement Learning
Evgenii Nikishin · Max Schwarzer · Pierluca D'Oro · Pierre-Luc Bacon · Aaron Courville

Poster

Tue 15:30

Efficient Model-based Multi-agent Reinforcement Learning via Optimistic Equilibrium Computation
Pier Giuseppe Sessa · Maryam Kamgarpour · Andreas Krause

Poster

Thu 15:00

Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path
Haoyuan Cai · Tengyu Ma · Simon Du

Spotlight

Wed 13:40

Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime
James-Michael Leahy · Bekzhan Kerimkulov · David Siska · Lukasz Szpruch

Workshop

Non-Markovian Policies for Unsupervised Reinforcement Learning in Multiple Environments
Pietro Maldini · Mirco Mutti · Riccardo De Santi · Marcello Restelli

Poster

Thu 15:00

Retrieval-Augmented Reinforcement Learning
Anirudh Goyal · Abe Friesen Friesen · Andrea Banino · Theophane Weber · Nan Rosemary Ke · Adrià Puigdomenech Badia · Arthur Guez · Mehdi Mirza · Peter Humphreys · Ksenia Konyushkova · Michal Valko · Simon Osindero · Timothy Lillicrap · Nicolas Heess · Charles Blundell

Spotlight

Thu 11:45

Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Ruiqi Zhang · Xuezhou Zhang · Chengzhuo Ni · Mengdi Wang

Main Navigation

356 Results