firstbacksecondback
44 Results
Spotlight
|
Wed 11:05 |
A Temporal-Difference Approach to Policy Gradient Estimation Samuele Tosatto · Andrew Patterson · Martha White · A. Mahmood |
|
Spotlight
|
Thu 11:55 |
On the Role of Discount Factor in Offline Reinforcement Learning Hao Hu · yiqin yang · Qianchuan Zhao · Chongjie Zhang |
|
Spotlight
|
Tue 10:55 |
Policy Gradient Method For Robust Reinforcement Learning Yue Wang · Shaofeng Zou |
|
Poster
|
Wed 15:30 |
A Temporal-Difference Approach to Policy Gradient Estimation Samuele Tosatto · Andrew Patterson · Martha White · A. Mahmood |
|
Poster
|
Thu 15:00 |
On the Role of Discount Factor in Offline Reinforcement Learning Hao Hu · yiqin yang · Qianchuan Zhao · Chongjie Zhang |
|
Poster
|
Tue 15:30 |
Policy Gradient Method For Robust Reinforcement Learning Yue Wang · Shaofeng Zou |
|
Spotlight
|
Wed 14:45 |
Distributionally Robust -Learning Zijian Liu · Jerry Bai · Jose Blanchet · Perry Dong · Wei Xu · Zhengqing Zhou · Zhengyuan Zhou |
|
Poster
|
Wed 15:30 |
Distributionally Robust -Learning Zijian Liu · Jerry Bai · Jose Blanchet · Perry Dong · Wei Xu · Zhengqing Zhou · Zhengyuan Zhou |
|
Spotlight
|
Wed 11:15 |
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency Qi Cai · Zhuoran Yang · Zhaoran Wang |
|
Spotlight
|
Wed 14:40 |
A Natural Actor-Critic Framework for Zero-Sum Markov Games Ahmet Alacaoglu · Luca Viano · Niao He · Volkan Cevher |
|
Spotlight
|
Thu 11:40 |
EAT-C: Environment-Adversarial sub-Task Curriculum for Efficient Reinforcement Learning Shuang Ao · Tianyi Zhou · Jing Jiang · Guodong Long · Xuan Song · Chengqi Zhang |
|
Spotlight
|
Wed 11:20 |
Actor-Critic based Improper Reinforcement Learning Mohammadi Zaki · Avi Mohan · Aditya Gopalan · Shie Mannor |