firstbacksecondback
83 Results
Oral
|
Wed 8:10 |
Learning Markov Games with Adversarial Opponents: Efficient Algorithms and Fundamental Limits Qinghua Liu · Yuanhao Wang · Chi Jin |
|
Poster
|
Wed 15:30 |
The Power of Exploiter: Provable Multi-Agent RL in Large State Spaces Chi Jin · Qinghua Liu · Tiancheng Yu |
|
Spotlight
|
Tue 13:35 |
Scalable Deep Reinforcement Learning Algorithms for Mean Field Games Mathieu Lauriere · Sarah Perrin · Sertan Girgin · Paul Muller · Ayush Jain · Theophile Cabannes · Georgios Piliouras · Julien Perolat · Romuald Elie · Olivier Pietquin · Matthieu Geist |
|
Poster
|
Wed 15:30 |
Modeling Strong and Human-Like Gameplay with KL-Regularized Search Athul Paul Jacob · David Wu · Gabriele Farina · Adam Lerer · Hengyuan Hu · Anton Bakhtin · Jacob Andreas · Noam Brown |
|
Workshop
|
An Adaptive Entropy-Regularization Framework for Multi-Agent Reinforcement Learning WOOJUN KIM · Youngchul Sung |
||
Spotlight
|
Thu 8:25 |
COLA: Consistent Learning with Opponent-Learning Awareness Timon Willi · Alistair Letcher · Johannes Treutlein · Jakob Foerster |
|
Workshop
|
The StarCraft Multi-Agent Challenges+ : Learning of Sub-tasks and Environmental Benefits without Precise Reward Functions (Poster) Mingyu Kim |
||
Workshop
|
Sat 12:10 |
The StarCraft Multi-Agent Challenges+ : Learning of Sub-tasks and Environmental Benefits without Precise Reward Functions Mingyu Kim |
|
Spotlight
|
Tue 14:10 |
Individual Reward Assisted Multi-Agent Reinforcement Learning Li Wang · Yupeng Zhang · Yujing Hu · Weixun Wang · Chongjie Zhang · Yang Gao · Jianye Hao · Tangjie Lv · Changjie Fan |
|
Poster
|
Tue 15:30 |
Scalable Deep Reinforcement Learning Algorithms for Mean Field Games Mathieu Lauriere · Sarah Perrin · Sertan Girgin · Paul Muller · Ayush Jain · Theophile Cabannes · Georgios Piliouras · Julien Perolat · Romuald Elie · Olivier Pietquin · Matthieu Geist |
|
Poster
|
Thu 15:00 |
An Analytical Update Rule for General Policy Optimization Hepeng Li · Nicholas Clavette · Haibo He |
|
Oral
|
Thu 13:05 |
An Analytical Update Rule for General Policy Optimization Hepeng Li · Nicholas Clavette · Haibo He |