firstbacksecondback
100 Results
Spotlight
|
Tue 7:40 |
Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning Harley Wiltzer · David Meger · Marc Bellemare |
|
Poster
|
Wed 15:30 |
A Natural Actor-Critic Framework for Zero-Sum Markov Games Ahmet Alacaoglu · Luca Viano · Niao He · Volkan Cevher |
|
Spotlight
|
Wed 7:30 |
Modeling Strong and Human-Like Gameplay with KL-Regularized Search Athul Paul Jacob · David Wu · Gabriele Farina · Adam Lerer · Hengyuan Hu · Anton Bakhtin · Jacob Andreas · Noam Brown |
|
Spotlight
|
Tue 11:45 |
Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms Jeongyeol Kwon · Yonathan Efroni · Constantine Caramanis · Shie Mannor |
|
Oral
|
Thu 10:30 |
Learning Bellman Complete Representations for Offline Policy Evaluation Jonathan Chang · Kaiwen Wang · Nathan Kallus · Wen Sun |
|
Spotlight
|
Thu 10:50 |
Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning Nathan Kallus · Xiaojie Mao · Kaiwen Wang · Zhengyuan Zhou |
|
Oral
|
Thu 7:30 |
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach Andrew Wagenmaker · Yifang Chen · Max Simchowitz · Simon Du · Kevin Jamieson |
|
Poster
|
Wed 15:30 |
Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation Chris Dann · Yishay Mansour · Mehryar Mohri · Ayush Sekhari · Karthik Sridharan |
|
Poster
|
Thu 15:00 |
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification Ling Pan · Longbo Huang · Tengyu Ma · Huazhe Xu |
|
Poster
|
Thu 15:00 |
Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling sajad khodadadian · PRANAY SHARMA · Gauri Joshi · Siva Maguluri |
|
Poster
|
Tue 15:30 |
Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis Ziyi Chen · Yi Zhou · Rong-Rong Chen · Shaofeng Zou |
|
Oral
|
Tue 8:00 |
Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP Liyu Chen · Rahul Jain · Haipeng Luo |