Skip to yearly menu bar Skip to main content


Search All 2022 Events
 

100 Results

<<   <   Page 4 of 9   >   >>
Spotlight
Tue 7:40 Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning
Harley Wiltzer · David Meger · Marc Bellemare
Poster
Wed 15:30 A Natural Actor-Critic Framework for Zero-Sum Markov Games
Ahmet Alacaoglu · Luca Viano · Niao He · Volkan Cevher
Spotlight
Wed 7:30 Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Athul Paul Jacob · David Wu · Gabriele Farina · Adam Lerer · Hengyuan Hu · Anton Bakhtin · Jacob Andreas · Noam Brown
Spotlight
Tue 11:45 Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms
Jeongyeol Kwon · Yonathan Efroni · Constantine Caramanis · Shie Mannor
Oral
Thu 10:30 Learning Bellman Complete Representations for Offline Policy Evaluation
Jonathan Chang · Kaiwen Wang · Nathan Kallus · Wen Sun
Spotlight
Thu 10:50 Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning
Nathan Kallus · Xiaojie Mao · Kaiwen Wang · Zhengyuan Zhou
Oral
Thu 7:30 First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach
Andrew Wagenmaker · Yifang Chen · Max Simchowitz · Simon Du · Kevin Jamieson
Poster
Wed 15:30 Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation
Chris Dann · Yishay Mansour · Mehryar Mohri · Ayush Sekhari · Karthik Sridharan
Poster
Thu 15:00 Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification
Ling Pan · Longbo Huang · Tengyu Ma · Huazhe Xu
Poster
Thu 15:00 Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling
sajad khodadadian · PRANAY SHARMA · Gauri Joshi · Siva Maguluri
Poster
Tue 15:30 Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis
Ziyi Chen · Yi Zhou · Rong-Rong Chen · Shaofeng Zou
Oral
Tue 8:00 Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP
Liyu Chen · Rahul Jain · Haipeng Luo