firstbacksecondback
100 Results
Oral
|
Thu 11:05 |
Do Differentiable Simulators Give Better Policy Gradients? Hyung Ju Suh · Max Simchowitz · Kaiqing Zhang · Russ Tedrake |
|
Spotlight
|
Thu 8:45 |
Branching Reinforcement Learning Yihan Du · Wei Chen |
|
Spotlight
|
Thu 11:10 |
Model Selection in Batch Policy Optimization Jonathan Lee · George Tucker · Ofir Nachum · Bo Dai |
|
Poster
|
Wed 15:30 |
Leveraging Approximate Symbolic Models for Reinforcement Learning via Skill Diversity Lin Guan · Sarath Sreedharan · Subbarao Kambhampati |
|
Poster
|
Thu 15:00 |
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents Wenlong Huang · Pieter Abbeel · Deepak Pathak · Igor Mordatch |
|
Poster
|
Thu 15:00 |
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning Adam Villaflor · Zhe Huang · Swapnil Pande · John Dolan · Jeff Schneider |
|
Poster
|
Wed 15:30 |
Causal Dynamics Learning for Task-Independent State Abstraction Zizhao Wang · Xuesu Xiao · Zifan Xu · Yuke Zhu · Peter Stone |
|
Poster
|
Thu 15:00 |
Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path Haoyuan Cai · Tengyu Ma · Simon Du |
|
Poster
|
Thu 15:00 |
Toward Compositional Generalization in Object-Oriented World Modeling Linfeng Zhao · Lingzhi Kong · Robin Walters · Lawson Wong |
|
Poster
|
Thu 15:00 |
Temporal Difference Learning for Model Predictive Control Nicklas Hansen · Hao Su · Xiaolong Wang |
|
Spotlight
|
Wed 14:40 |
A Natural Actor-Critic Framework for Zero-Sum Markov Games Ahmet Alacaoglu · Luca Viano · Niao He · Volkan Cevher |
|
Spotlight
|
Wed 14:45 |
Distributionally Robust -Learning Zijian Liu · Jerry Bai · Jose Blanchet · Perry Dong · Wei Xu · Zhengqing Zhou · Zhengyuan Zhou |