firstbacksecondback
64 Results
Poster
|
Thu 15:00 |
Constrained Variational Policy Optimization for Safe Reinforcement Learning Zuxin Liu · Zhepeng Cen · Vladislav Isenbaev · Wei Liu · Steven Wu · Bo Li · Ding Zhao |
|
Spotlight
|
Wed 8:45 |
Near-Optimal Learning of Extensive-Form Games with Imperfect Information Yu Bai · Chi Jin · Song Mei · Tiancheng Yu |
|
Poster
|
Wed 15:30 |
Denoised MDPs: Learning World Models Better Than the World Itself Tongzhou Wang · Simon Du · Antonio Torralba · Phillip Isola · Amy Zhang · Yuandong Tian |
|
Poster
|
Wed 15:30 |
Short-Term Plasticity Neurons Learning to Learn and Forget Hector Garcia Rodriguez · Qinghai Guo · Timoleon Moraitis |
|
Spotlight
|
Wed 11:15 |
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency Qi Cai · Zhuoran Yang · Zhaoran Wang |
|
Oral
|
Thu 7:30 |
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach Andrew Wagenmaker · Yifang Chen · Max Simchowitz · Simon Du · Kevin Jamieson |
|
Poster
|
Thu 15:00 |
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach Andrew Wagenmaker · Yifang Chen · Max Simchowitz · Simon Du · Kevin Jamieson |
|
Poster
|
Wed 15:30 |
Near-Optimal Learning of Extensive-Form Games with Imperfect Information Yu Bai · Chi Jin · Song Mei · Tiancheng Yu |
|
Spotlight
|
Tue 11:45 |
Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms Jeongyeol Kwon · Yonathan Efroni · Constantine Caramanis · Shie Mannor |
|
Poster
|
Wed 15:30 |
History Compression via Language Models in Reinforcement Learning Fabian Paischer · Thomas Adler · Vihang Patil · Angela Bitto-Nemling · Markus Holzleitner · Sebastian Lehner · Hamid Eghbal-zadeh · Sepp Hochreiter |
|
Poster
|
Wed 15:30 |
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency Qi Cai · Zhuoran Yang · Zhaoran Wang |
|
Oral
|
Tue 8:00 |
Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP Liyu Chen · Rahul Jain · Haipeng Luo |