firstbacksecondback
32 Results
Poster
|
Wed 8:00 |
Planning to Explore via Self-Supervised World Models Ramanan Sekar · Oleh Rybkin · Kostas Daniilidis · Pieter Abbeel · Danijar Hafner · Deepak Pathak |
|
Poster
|
Thu 12:00 |
Predictive Coding for Locally-Linear Control Rui Shu · Tung Nguyen · Yinlam Chow · Tuan Pham · Khoat Than · Mohammad Ghavamzadeh · Stefano Ermon · Hung Bui |
|
Poster
|
Thu 12:00 |
Monte-Carlo Tree Search as Regularized Policy Optimization Jean-Bastien Grill · Florent Altché · Yunhao Tang · Thomas Hubert · Michal Valko · Ioannis Antonoglou · Remi Munos |
|
Poster
|
Tue 10:00 |
Optimally Solving Two-Agent Decentralized POMDPs Under One-Sided Information Sharing Yuxuan Xie · Jilles Dibangoye · Olivier Buffet |
|
Poster
|
Thu 14:00 |
Multi-Agent Determinantal Q-Learning Yaodong Yang · Ying Wen · Jun Wang · Liheng Chen · Kun Shao · David Mguni · Weinan Zhang |
|
Poster
|
Tue 7:00 |
Q-value Path Decomposition for Deep Multiagent Reinforcement Learning Yaodong Yang · Jianye Hao · Guangyong Chen · Hongyao Tang · Yingfeng Chen · Yujing Hu · Changjie Fan · Zhongyu Wei |
|
Poster
|
Wed 12:00 |
CoMic: Complementary Task Learning & Mimicry for Reusable Skills Leonard Hasenclever · Fabio Pardo · Raia Hadsell · Nicolas Heess · Josh Merel |
|
Poster
|
Wed 5:00 |
The Differentiable Cross-Entropy Method Brandon Amos · Denis Yarats |