firstbacksecondback
356 Results
Spotlight
|
Wed 14:35 |
Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods Yi Wan · Ali Rahimi-Kalahroudi · Janarthanan Rajendran · Ida Momennejad · Sarath Chandar · Harm van Seijen |
|
Poster
|
Wed 15:30 |
Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime James-Michael Leahy · Bekzhan Kerimkulov · David Siska · Lukasz Szpruch |
|
Poster
|
Thu 15:00 |
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory Ruiqi Zhang · Xuezhou Zhang · Chengzhuo Ni · Mengdi Wang |
|
Poster
|
Wed 15:30 |
Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods Yi Wan · Ali Rahimi-Kalahroudi · Janarthanan Rajendran · Ida Momennejad · Sarath Chandar · Harm van Seijen |
|
Spotlight
|
Wed 8:00 |
History Compression via Language Models in Reinforcement Learning Fabian Paischer · Thomas Adler · Vihang Patil · Angela Bitto-Nemling · Markus Holzleitner · Sebastian Lehner · Hamid Eghbal-zadeh · Sepp Hochreiter |
|
Oral
|
Tue 10:30 |
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution Vihang Patil · Markus Hofmarcher · Marius-Constantin Dinu · Matthias Dorfer · Patrick Blies · Johannes Brandstetter · Jose A. Arjona-Medina · Sepp Hochreiter |
|
Poster
|
Tue 15:30 |
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution Vihang Patil · Markus Hofmarcher · Marius-Constantin Dinu · Matthias Dorfer · Patrick Blies · Johannes Brandstetter · Jose A. Arjona-Medina · Sepp Hochreiter |
|
Poster
|
Wed 15:30 |
History Compression via Language Models in Reinforcement Learning Fabian Paischer · Thomas Adler · Vihang Patil · Angela Bitto-Nemling · Markus Holzleitner · Sebastian Lehner · Hamid Eghbal-zadeh · Sepp Hochreiter |