firstbacksecondback
44 Results
Spotlight
|
Thu 11:00 |
Understanding Policy Gradient Algorithms: A Sensitivity-Based Approach Shuang Wu · Ling Shi · Jun Wang · Guangjian Tian |
|
Poster
|
Thu 15:00 |
Understanding Policy Gradient Algorithms: A Sensitivity-Based Approach Shuang Wu · Ling Shi · Jun Wang · Guangjian Tian |
|
Spotlight
|
Wed 14:40 |
A Natural Actor-Critic Framework for Zero-Sum Markov Games Ahmet Alacaoglu · Luca Viano · Niao He · Volkan Cevher |
|
Spotlight
|
Wed 11:20 |
Actor-Critic based Improper Reinforcement Learning Mohammadi Zaki · Avi Mohan · Aditya Gopalan · Shie Mannor |
|
Poster
|
Wed 15:30 |
A Natural Actor-Critic Framework for Zero-Sum Markov Games Ahmet Alacaoglu · Luca Viano · Niao He · Volkan Cevher |
|
Poster
|
Wed 15:30 |
Actor-Critic based Improper Reinforcement Learning Mohammadi Zaki · Avi Mohan · Aditya Gopalan · Shie Mannor |
|
Poster
|
Wed 15:30 |
Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime James-Michael Leahy · Bekzhan Kerimkulov · David Siska · Lukasz Szpruch |
|
Spotlight
|
Wed 13:40 |
Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime James-Michael Leahy · Bekzhan Kerimkulov · David Siska · Lukasz Szpruch |
|
Spotlight
|
Wed 14:45 |
Reachability Constrained Reinforcement Learning Dongjie Yu · Haitong Ma · Shengbo Li · Jianyu Chen |
|
Poster
|
Wed 15:30 |
Reachability Constrained Reinforcement Learning Dongjie Yu · Haitong Ma · Shengbo Li · Jianyu Chen |
|
Spotlight
|
Wed 7:50 |
Generalized Data Distribution Iteration Jiajun Fan · Changnan Xiao |
|
Poster
|
Wed 15:30 |
Generalized Data Distribution Iteration Jiajun Fan · Changnan Xiao |