(13 events)   Timezone: »  
Show all »
Toggle Poster Visibility
Spotlight
Tue Jul 19 07:30 AM -- 07:35 AM (PDT) @ Hall F
Dynamic Regret of Online Markov Decision Processes
Peng Zhao · Long-Fei Li · Zhi-Hua Zhou
[ Slides
Spotlight
Tue Jul 19 07:35 AM -- 07:40 AM (PDT) @ Hall F
On the Impossibility of Learning to Cooperate with Adaptive Partner Strategies in Repeated Games
Robert Loftin · Frans Oliehoek
[ Slides
Spotlight
Tue Jul 19 07:40 AM -- 07:45 AM (PDT) @ Hall F
Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning
Harley Wiltzer · David Meger · Marc Bellemare
[ Slides
Spotlight
Tue Jul 19 07:45 AM -- 07:50 AM (PDT) @ Hall F
Provable Reinforcement Learning with a Short-Term Memory
Yonathan Efroni · Chi Jin · Akshay Krishnamurthy · Sobhan Miryoosefi
Spotlight
Tue Jul 19 07:50 AM -- 07:55 AM (PDT) @ Hall F
Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer
Lucas N. Alegre · Ana Lucia Cetertich Bazzan · Bruno C. da Silva
Spotlight
Tue Jul 19 07:55 AM -- 08:00 AM (PDT) @ Hall F
Mirror Learning: A Unifying Framework of Policy Optimisation
Jakub Grudzien Kuba · Christian Schroeder de Witt · Jakob Foerster
[ Slides
Oral
Tue Jul 19 08:00 AM -- 08:20 AM (PDT) @ Hall F
Improved No-Regret Algorithms for Stochastic Shortest Path with Linear MDP
Liyu Chen · Rahul Jain · Haipeng Luo
[ Slides
Spotlight
Tue Jul 19 08:20 AM -- 08:25 AM (PDT) @ Hall F
Learning Infinite-horizon Average-reward Markov Decision Process with Constraints
Liyu Chen · Rahul Jain · Haipeng Luo
[ Slides
Spotlight
Tue Jul 19 08:25 AM -- 08:30 AM (PDT) @ Hall F
A State-Distribution Matching Approach to Non-Episodic Reinforcement Learning
Archit Sharma · Rehaan Ahmad · Chelsea Finn
[ Slides
Spotlight
Tue Jul 19 08:30 AM -- 08:35 AM (PDT) @ Hall F
Langevin Monte Carlo for Contextual Bandits
Pan Xu · Hongkai Zheng · Eric Mazumdar · Kamyar Azizzadenesheli · Animashree Anandkumar
[ Slides
Spotlight
Tue Jul 19 08:35 AM -- 08:40 AM (PDT) @ Hall F
Prompting Decision Transformer for Few-Shot Policy Generalization
Mengdi Xu · Yikang Shen · Shun Zhang · Yuchen Lu · Ding Zhao · Josh Tenenbaum · Chuang Gan
[ Slides
Spotlight
Tue Jul 19 08:40 AM -- 08:45 AM (PDT) @ Hall F
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Shuang Qiu · Lingxiao Wang · Chenjia Bai · Zhuoran Yang · Zhaoran Wang
[ Slides
Spotlight
Tue Jul 19 08:45 AM -- 08:50 AM (PDT) @ Hall F
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation
Xiaoyu Chen · Han Zhong · Zhuoran Yang · Zhaoran Wang · Liwei Wang
[ Slides