firstbacksecondback
32 Results
Poster
|
Tue 8:00 |
Boosting for Control of Dynamical Systems Naman Agarwal · Nataly Brukhim · Elad Hazan · Zhou Lu |
|
Poster
|
Tue 11:00 |
Stochastically Dominant Distributional Reinforcement Learning John Martin · Michal Lyskawinski · Xiaohu Li · Brendan Englot |
|
Poster
|
Wed 11:00 |
“Other-Play” for Zero-Shot Coordination Hengyuan Hu · Alexander Peysakhovich · Adam Lerer · Jakob Foerster |
|
Poster
|
Tue 9:00 |
Variational Imitation Learning with Diverse-quality Demonstrations Voot Tangkaratt · Bo Han · Mohammad Emtiyaz Khan · Masashi Sugiyama |
|
Poster
|
Thu 7:00 |
ConQUR: Mitigating Delusional Bias in Deep Q-Learning DiJia Su · Jayden Ooi · Tyler Lu · Dale Schuurmans · Craig Boutilier |
|
Poster
|
Wed 9:00 |
Flexible and Efficient Long-Range Planning Through Curious Exploration Aidan Curtis · Minjian Xin · Dilip Arumugam · Kevin Feigelis · Daniel Yamins |
|
Poster
|
Tue 11:00 |
Kernel Methods for Cooperative Multi-Agent Contextual Bandits Abhimanyu Dubey · Alex `Sandy' Pentland |
|
Poster
|
Thu 12:00 |
Information Particle Filter Tree: An Online Algorithm for POMDPs with Belief-Based Rewards on Continuous Domains Johannes Fischer · Ömer Sahin Tas |
|
Poster
|
Wed 9:00 |
Cooperative Multi-Agent Bandits with Heavy Tails Abhimanyu Dubey · Alex `Sandy' Pentland |
|
Poster
|
Tue 8:00 |
FormulaZero: Distributionally Robust Online Adaptation via Offline Population Synthesis Aman Sinha · Matthew O'Kelly · Hongrui Zheng · Rahul Mangharam · John Duchi · Russ Tedrake |
|
Poster
|
Wed 9:00 |
Low-Variance and Zero-Variance Baselines for Extensive-Form Games Trevor Davis · Martin Schmid · Michael Bowling |
|
Poster
|
Thu 9:00 |
Symbolic Network: Generalized Neural Policies for Relational MDPs Sankalp Garg · Aniket Bajpai · Mausam |