firstbacksecondback
100 Results
Spotlight
|
Thu 8:50 |
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification Ling Pan · Longbo Huang · Tengyu Ma · Huazhe Xu |
|
Poster
|
Thu 15:00 |
Do Differentiable Simulators Give Better Policy Gradients? Hyung Ju Suh · Max Simchowitz · Kaiqing Zhang · Russ Tedrake |
|
Spotlight
|
Tue 14:40 |
Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis Ziyi Chen · Yi Zhou · Rong-Rong Chen · Shaofeng Zou |
|
Spotlight
|
Wed 8:50 |
Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation Chris Dann · Yishay Mansour · Mehryar Mohri · Ayush Sekhari · Karthik Sridharan |
|
Spotlight
|
Thu 7:55 |
Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization Minghuan Liu · Zhengbang Zhu · Yuzheng Zhuang · Weinan Zhang · Jianye Hao · Yong Yu · Jun Wang |
|
Poster
|
Thu 15:00 |
Branching Reinforcement Learning Yihan Du · Wei Chen |
|
Spotlight
|
Wed 14:45 |
Distributionally Robust -Learning Zijian Liu · Jerry Bai · Jose Blanchet · Perry Dong · Wei Xu · Zhengqing Zhou · Zhengyuan Zhou |
|
Spotlight
|
Thu 8:30 |
Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost Dan Qiao · Ming Yin · Ming Min · Yu-Xiang Wang |
|
Oral
|
Thu 7:30 |
Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling sajad khodadadian · PRANAY SHARMA · Gauri Joshi · Siva Maguluri |
|
Spotlight
|
Wed 8:45 |
Near-Optimal Learning of Extensive-Form Games with Imperfect Information Yu Bai · Chi Jin · Song Mei · Tiancheng Yu |
|
Spotlight
|
Thu 11:55 |
On the Role of Discount Factor in Offline Reinforcement Learning Hao Hu · yiqin yang · Qianchuan Zhao · Chongjie Zhang |
|
Spotlight
|
Tue 11:45 |
Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms Jeongyeol Kwon · Yonathan Efroni · Constantine Caramanis · Shie Mannor |