firstbacksecondback
14 Results
Poster
|
Tue 18:30 |
Open-ended learning in symmetric zero-sum games David Balduzzi · Marta Garnelo · Yoram Bachrach · Wojciech Czarnecki · Julien Perolat · Max Jaderberg · Thore Graepel |
|
Poster
|
Tue 18:30 |
Stable-Predictive Optimistic Counterfactual Regret Minimization Gabriele Farina · Christian Kroer · Noam Brown · Tuomas Sandholm |
|
Poster
|
Tue 18:30 |
Learning from a Learner alexis jacq · Matthieu Geist · Ana Paiva · Olivier Pietquin |
|
Poster
|
Tue 18:30 |
Multi-Agent Adversarial Inverse Reinforcement Learning Lantao Yu · Jiaming Song · Stefano Ermon |
|
Poster
|
Thu 18:30 |
Actor-Attention-Critic for Multi-Agent Reinforcement Learning Shariq Iqbal · Fei Sha |
|
Poster
|
Tue 18:30 |
Deep Counterfactual Regret Minimization Noam Brown · Adam Lerer · Sam Gross · Tuomas Sandholm |
|
Poster
|
Wed 18:30 |
A Baseline for Any Order Gradient Estimation in Stochastic Computation Graphs Jingkai Mao · Jakob Foerster · Tim Rocktäschel · Maruan Al-Shedivat · Gregory Farquhar · Shimon Whiteson |
|
Poster
|
Thu 18:30 |
QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning Kyunghwan Son · Daewoo Kim · Wan Ju Kang · David Earl Hostallero · Yung Yi |
|
Poster
|
Tue 18:30 |
Learning to Collaborate in Markov Decision Processes Goran Radanovic · Rati Devidze · David Parkes · Adish Singla |
|
Poster
|
Tue 18:30 |
Grid-Wise Control for Multi-Agent Reinforcement Learning in Video Game AI Lei Han · Peng Sun · Yali Du · Jiechao Xiong · Qing Wang · Xinghai Sun · Han Liu · Tong Zhang |
|
Poster
|
Thu 18:30 |
TarMAC: Targeted Multi-Agent Communication Abhishek Das · Theophile Gervet · Joshua Romoff · Dhruv Batra · Devi Parikh · Michael Rabbat · Joelle Pineau |
|
Poster
|
Thu 18:30 |
Finite-Time Analysis of Distributed TD(0) with Linear Function Approximation on Multi-Agent Reinforcement Learning Thinh Doan · Siva Maguluri · Justin Romberg |