firstbacksecondback
356 Results
Spotlight
|
Tue 11:50 |
The Primacy Bias in Deep Reinforcement Learning Evgenii Nikishin · Max Schwarzer · Pierluca D'Oro · Pierre-Luc Bacon · Aaron Courville |
|
Workshop
|
On the Importance of Critical Period in Multi-stage Reinforcement Learning Junseok Park · Inwoo Hwang · Min Whoo Lee · Hyunseok Oh · Minsu Lee · Youngki Lee · Byoung-Tak Zhang |
||
Spotlight
|
Wed 13:30 |
Improved Regret for Differentially Private Exploration in Linear MDP Dung Ngo · Giuseppe Vietri · Steven Wu |
|
Poster
|
Wed 15:30 |
On Well-posedness and Minimax Optimal Rates of Nonparametric Q-function Estimation in Off-policy Evaluation Xiaohong Chen · Zhengling Qi |
|
Workshop
|
Recursive History Representations for Unsupervised Reinforcement Learning in Multiple-Environments Mirco Mutti · Pietro Maldini · Riccardo De Santi · Marcello Restelli |
||
Spotlight
|
Tue 13:35 |
Scalable Deep Reinforcement Learning Algorithms for Mean Field Games Mathieu Lauriere · Sarah Perrin · Sertan Girgin · Paul Muller · Ayush Jain · Theophile Cabannes · Georgios Piliouras · Julien Perolat · Romuald Elie · Olivier Pietquin · Matthieu Geist |
|
Poster
|
Wed 15:30 |
Near-Optimal Learning of Extensive-Form Games with Imperfect Information Yu Bai · Chi Jin · Song Mei · Tiancheng Yu |
|
Poster
|
Wed 15:30 |
A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes Chengchun Shi · Masatoshi Uehara · Jiawei Huang · Nan Jiang |
|
Spotlight
|
Tue 13:25 |
Efficient Model-based Multi-agent Reinforcement Learning via Optimistic Equilibrium Computation Pier Giuseppe Sessa · Maryam Kamgarpour · Andreas Krause |
|
Poster
|
Tue 15:30 |
PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration Pengyi Li · Hongyao Tang · Tianpei Yang · Xiaotian Hao · Tong Sang · Yan Zheng · Jianye Hao · Matthew Taylor · Wenyuan Tao · Zhen Wang |
|
Oral
|
Wed 7:30 |
A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes Chengchun Shi · Masatoshi Uehara · Jiawei Huang · Nan Jiang |
|
Spotlight
|
Thu 8:55 |
Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path Haoyuan Cai · Tengyu Ma · Simon Du |