firstbacksecondback
77 Results
Poster
|
Wed 15:30 |
Modeling Strong and Human-Like Gameplay with KL-Regularized Search Athul Paul Jacob · David Wu · Gabriele Farina · Adam Lerer · Hengyuan Hu · Anton Bakhtin · Jacob Andreas · Noam Brown |
|
Poster
|
Wed 15:30 |
The Power of Exploiter: Provable Multi-Agent RL in Large State Spaces Chi Jin · Qinghua Liu · Tiancheng Yu |
|
Spotlight
|
Thu 8:25 |
COLA: Consistent Learning with Opponent-Learning Awareness Timon Willi · Alistair Letcher · Johannes Treutlein · Jakob Foerster |
|
Spotlight
|
Tue 14:10 |
Individual Reward Assisted Multi-Agent Reinforcement Learning Li Wang · Yupeng Zhang · Yujing Hu · Weixun Wang · Chongjie Zhang · Yang Gao · Jianye Hao · Tangjie Lv · Changjie Fan |
|
Poster
|
Tue 15:30 |
Scalable Deep Reinforcement Learning Algorithms for Mean Field Games Mathieu Lauriere · Sarah Perrin · Sertan Girgin · Paul Muller · Ayush Jain · Theophile Cabannes · Georgios Piliouras · Julien Perolat · Romuald Elie · Olivier Pietquin · Matthieu Geist |
|
Poster
|
Thu 15:00 |
COLA: Consistent Learning with Opponent-Learning Awareness Timon Willi · Alistair Letcher · Johannes Treutlein · Jakob Foerster |
|
Poster
|
Tue 15:30 |
Individual Reward Assisted Multi-Agent Reinforcement Learning Li Wang · Yupeng Zhang · Yujing Hu · Weixun Wang · Chongjie Zhang · Yang Gao · Jianye Hao · Tangjie Lv · Changjie Fan |
|
Spotlight
|
Thu 11:35 |
Evolving Curricula with Regret-Based Environment Design Jack Parker-Holder · Minqi Jiang · Michael Dennis · Mikayel Samvelyan · Jakob Foerster · Edward Grefenstette · Tim Rocktäschel |
|
Spotlight
|
Wed 8:45 |
Near-Optimal Learning of Extensive-Form Games with Imperfect Information Yu Bai · Chi Jin · Song Mei · Tiancheng Yu |
|
Spotlight
|
Tue 11:40 |
PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration Pengyi Li · Hongyao Tang · Tianpei Yang · Xiaotian Hao · Tong Sang · Yan Zheng · Jianye Hao · Matthew Taylor · Wenyuan Tao · Zhen Wang |
|
Poster
|
Thu 15:00 |
An Analytical Update Rule for General Policy Optimization Hepeng Li · Nicholas Clavette · Haibo He |
|
Oral
|
Thu 13:05 |
An Analytical Update Rule for General Policy Optimization Hepeng Li · Nicholas Clavette · Haibo He |