firstbacksecondback
44 Results
Spotlight
|
Thu 11:50 |
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation) Huang Bojun |
|
Poster
|
Thu 15:00 |
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation) Huang Bojun |
|
Oral
|
Thu 7:30 |
Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling sajad khodadadian · PRANAY SHARMA · Gauri Joshi · Siva Maguluri |
|
Poster
|
Thu 15:00 |
Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling sajad khodadadian · PRANAY SHARMA · Gauri Joshi · Siva Maguluri |
|
Spotlight
|
Wed 13:40 |
Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime James-Michael Leahy · Bekzhan Kerimkulov · David Siska · Lukasz Szpruch |
|
Poster
|
Wed 15:30 |
Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime James-Michael Leahy · Bekzhan Kerimkulov · David Siska · Lukasz Szpruch |
|
Poster
|
Tue 15:30 |
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution Vihang Patil · Markus Hofmarcher · Marius-Constantin Dinu · Matthias Dorfer · Patrick Blies · Johannes Brandstetter · Jose A. Arjona-Medina · Sepp Hochreiter |
|
Oral
|
Tue 10:30 |
Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution Vihang Patil · Markus Hofmarcher · Marius-Constantin Dinu · Matthias Dorfer · Patrick Blies · Johannes Brandstetter · Jose A. Arjona-Medina · Sepp Hochreiter |