firstbacksecondback
91 Results
Oral
|
Tue 13:45 |
Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence Dongsheng Ding · Chen-Yu Wei · Kaiqing Zhang · Mihailo Jovanovic |
|
Poster
|
Tue 15:30 |
Individual Reward Assisted Multi-Agent Reinforcement Learning Li Wang · Yupeng Zhang · Yujing Hu · Weixun Wang · Chongjie Zhang · Yang Gao · Jianye Hao · Tangjie Lv · Changjie Fan |
|
Poster
|
Wed 15:30 |
History Compression via Language Models in Reinforcement Learning Fabian Paischer · Thomas Adler · Vihang Patil · Angela Bitto-Nemling · Markus Holzleitner · Sebastian Lehner · Hamid Eghbal-zadeh · Sepp Hochreiter |
|
Oral
|
Thu 11:05 |
Do Differentiable Simulators Give Better Policy Gradients? Hyung Ju Suh · Max Simchowitz · Kaiqing Zhang · Russ Tedrake |
|
Poster
|
Thu 15:00 |
Off-Policy Evaluation for Large Action Spaces via Embeddings Yuta Saito · Thorsten Joachims |
|
Poster
|
Thu 15:00 |
Do Differentiable Simulators Give Better Policy Gradients? Hyung Ju Suh · Max Simchowitz · Kaiqing Zhang · Russ Tedrake |
|
Poster
|
Tue 15:30 |
Policy Gradient Method For Robust Reinforcement Learning Yue Wang · Shaofeng Zou |
|
Poster
|
Tue 15:30 |
Mirror Learning: A Unifying Framework of Policy Optimisation Jakub Grudzien Kuba · Christian Schroeder de Witt · Jakob Foerster |
|
Poster
|
Thu 15:00 |
Adversarially Trained Actor Critic for Offline Reinforcement Learning Ching-An Cheng · Tengyang Xie · Nan Jiang · Alekh Agarwal |
|
Poster
|
Tue 15:30 |
Independent Policy Gradient for Large-Scale Markov Potential Games: Sharper Rates, Function Approximation, and Game-Agnostic Convergence Dongsheng Ding · Chen-Yu Wei · Kaiqing Zhang · Mihailo Jovanovic |
|
Oral
|
Thu 13:05 |
An Analytical Update Rule for General Policy Optimization Hepeng Li · Nicholas Clavette · Haibo He |
|
Spotlight
|
Tue 11:40 |
Interpretable Off-Policy Learning via Hyperbox Search Daniel Tschernutter · Tobias Hatt · Stefan Feuerriegel |