firstbacksecondback
44 Results
Spotlight
|
Tue 10:55 |
Policy Gradient Method For Robust Reinforcement Learning Yue Wang · Shaofeng Zou |
|
Spotlight
|
Thu 11:55 |
On the Role of Discount Factor in Offline Reinforcement Learning Hao Hu · yiqin yang · Qianchuan Zhao · Chongjie Zhang |
|
Poster
|
Tue 15:30 |
Policy Gradient Method For Robust Reinforcement Learning Yue Wang · Shaofeng Zou |
|
Poster
|
Thu 15:00 |
On the Role of Discount Factor in Offline Reinforcement Learning Hao Hu · yiqin yang · Qianchuan Zhao · Chongjie Zhang |
|
Spotlight
|
Wed 14:45 |
Distributionally Robust Q-Learning Zijian Liu · Jerry Bai · Jose Blanchet · Perry Dong · Wei Xu · Zhengqing Zhou · Zhengyuan Zhou |
|
Poster
|
Wed 15:30 |
Distributionally Robust Q-Learning Zijian Liu · Jerry Bai · Jose Blanchet · Perry Dong · Wei Xu · Zhengqing Zhou · Zhengyuan Zhou |
|
Spotlight
|
Wed 11:25 |
On the Sample Complexity of Learning Infinite-horizon Discounted Linear Kernel MDPs Yuanzhou Chen · Jiafan He · Quanquan Gu |
|
Poster
|
Wed 15:30 |
On the Sample Complexity of Learning Infinite-horizon Discounted Linear Kernel MDPs Yuanzhou Chen · Jiafan He · Quanquan Gu |
|
Oral
|
Thu 7:30 |
Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling sajad khodadadian · PRANAY SHARMA · Gauri Joshi · Siva Maguluri |
|
Poster
|
Thu 15:00 |
Federated Reinforcement Learning: Linear Speedup Under Markovian Sampling sajad khodadadian · PRANAY SHARMA · Gauri Joshi · Siva Maguluri |
|
Spotlight
|
Wed 11:05 |
A Temporal-Difference Approach to Policy Gradient Estimation Samuele Tosatto · Andrew Patterson · Martha White · A. Mahmood |
|
Poster
|
Wed 15:30 |
A Temporal-Difference Approach to Policy Gradient Estimation Samuele Tosatto · Andrew Patterson · Martha White · A. Mahmood |