Spotlight
|
Wed 8:00
|
History Compression via Language Models in Reinforcement Learning
Fabian Paischer · Thomas Adler · Vihang Patil · Angela Bitto-Nemling · Markus Holzleitner · Sebastian Lehner · Hamid Eghbal-zadeh · Sepp Hochreiter
|
|
Poster
|
Tue 15:30
|
Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms
Jeongyeol Kwon · Yonathan Efroni · Constantine Caramanis · Shie Mannor
|
|
Oral
|
Tue 11:05
|
Training Characteristic Functions with Reinforcement Learning: XAI-methods play Connect Four
Stephan Wäldchen · Sebastian Pokutta · Felix Huber
|
|
Poster
|
Wed 15:30
|
Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models and Amortized Policy Search
Qi Wang · Herke van Hoof
|
|
Poster
|
Thu 15:00
|
Understanding Policy Gradient Algorithms: A Sensitivity-Based Approach
Shuang Wu · Ling Shi · Jun Wang · Guangjian Tian
|
|
Spotlight
|
Thu 13:10
|
Off-Policy Evaluation for Large Action Spaces via Embeddings
Yuta Saito · Thorsten Joachims
|
|
Oral
|
Thu 11:15
|
Adversarially Trained Actor Critic for Offline Reinforcement Learning
Ching-An Cheng · Tengyang Xie · Nan Jiang · Alekh Agarwal
|
|
Poster
|
Wed 15:30
|
Biased Gradient Estimate with Drastic Variance Reduction for Meta Reinforcement Learning
Yunhao Tang
|
|
Spotlight
|
Tue 7:55
|
Mirror Learning: A Unifying Framework of Policy Optimisation
Jakub Grudzien Kuba · Christian Schroeder de Witt · Jakob Foerster
|
|
Poster
|
Tue 15:30
|
Training Characteristic Functions with Reinforcement Learning: XAI-methods play Connect Four
Stephan Wäldchen · Sebastian Pokutta · Felix Huber
|
|
Poster
|
Wed 15:30
|
A Temporal-Difference Approach to Policy Gradient Estimation
Samuele Tosatto · Andrew Patterson · Martha White · A. Mahmood
|
|
Spotlight
|
Tue 10:55
|
Policy Gradient Method For Robust Reinforcement Learning
Yue Wang · Shaofeng Zou
|
|