firstbacksecondback
64 Results
Spotlight
|
Wed 11:40 |
Short-Term Plasticity Neurons Learning to Learn and Forget Hector Garcia Rodriguez · Qinghai Guo · Timoleon Moraitis |
|
Poster
|
Tue 15:30 |
Greedy when Sure and Conservative when Uncertain about the Opponents Haobo Fu · Ye Tian · Hongxiang Yu · Weiming Liu · Shuang Wu · Jiechao Xiong · Ying Wen · Kai Li · Junliang Xing · Qiang Fu · Wei Yang |
|
Poster
|
Wed 15:30 |
A Temporal-Difference Approach to Policy Gradient Estimation Samuele Tosatto · Andrew Patterson · Martha White · A. Mahmood |
|
Poster
|
Tue 15:30 |
Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis Ziyi Chen · Yi Zhou · Rong-Rong Chen · Shaofeng Zou |
|
Poster
|
Thu 15:00 |
Sample-Efficient Reinforcement Learning with loglog(T) Switching Cost Dan Qiao · Ming Yin · Ming Min · Yu-Xiang Wang |
|
Poster
|
Thu 15:00 |
Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path Haoyuan Cai · Tengyu Ma · Simon Du |
|
Poster
|
Thu 15:00 |
Making Linear MDPs Practical via Contrastive Representation Learning Tianjun Zhang · Tongzheng Ren · Mengjiao Yang · Joseph E Gonzalez · Dale Schuurmans · Bo Dai |
|
Poster
|
Tue 15:30 |
Dynamic Regret of Online Markov Decision Processes Peng Zhao · Long-Fei Li · Zhi-Hua Zhou |
|
Poster
|
Tue 15:30 |
Efficient Model-based Multi-agent Reinforcement Learning via Optimistic Equilibrium Computation Pier Giuseppe Sessa · Maryam Kamgarpour · Andreas Krause |
|
Poster
|
Tue 15:30 |
Learning Infinite-horizon Average-reward Markov Decision Process with Constraints Liyu Chen · Rahul Jain · Haipeng Luo |
|
Spotlight
|
Wed 8:00 |
History Compression via Language Models in Reinforcement Learning Fabian Paischer · Thomas Adler · Vihang Patil · Angela Bitto-Nemling · Markus Holzleitner · Sebastian Lehner · Hamid Eghbal-zadeh · Sepp Hochreiter |
|
Poster
|
Thu 15:00 |
Model Selection in Batch Policy Optimization Jonathan Lee · George Tucker · Ofir Nachum · Bo Dai |