firstbacksecondback
44 Results
Poster
|
Wed 15:30 |
Generalized Data Distribution Iteration Jiajun Fan · Changnan Xiao |
|
Spotlight
|
Wed 8:40 |
Zero-Shot Reward Specification via Grounded Natural Language Parsa Mahmoudieh · Deepak Pathak · Trevor Darrell |
|
Spotlight
|
Thu 11:00 |
Understanding Policy Gradient Algorithms: A Sensitivity-Based Approach Shuang Wu · Ling Shi · Jun Wang · Guangjian Tian |
|
Spotlight
|
Thu 11:55 |
On the Role of Discount Factor in Offline Reinforcement Learning Hao Hu · yiqin yang · Qianchuan Zhao · Chongjie Zhang |
|
Spotlight
|
Tue 10:55 |
Policy Gradient Method For Robust Reinforcement Learning Yue Wang · Shaofeng Zou |
|
Spotlight
|
Wed 11:05 |
A Temporal-Difference Approach to Policy Gradient Estimation Samuele Tosatto · Andrew Patterson · Martha White · A. Mahmood |
|
Poster
|
Wed 15:30 |
Zero-Shot Reward Specification via Grounded Natural Language Parsa Mahmoudieh · Deepak Pathak · Trevor Darrell |
|
Poster
|
Thu 15:00 |
EAT-C: Environment-Adversarial sub-Task Curriculum for Efficient Reinforcement Learning Shuang Ao · Tianyi Zhou · Jing Jiang · Guodong Long · Xuan Song · Chengqi Zhang |
|
Poster
|
Wed 15:30 |
A Temporal-Difference Approach to Policy Gradient Estimation Samuele Tosatto · Andrew Patterson · Martha White · A. Mahmood |
|
Poster
|
Thu 15:00 |
On the Role of Discount Factor in Offline Reinforcement Learning Hao Hu · yiqin yang · Qianchuan Zhao · Chongjie Zhang |
|
Poster
|
Tue 15:30 |
Policy Gradient Method For Robust Reinforcement Learning Yue Wang · Shaofeng Zou |
|
Spotlight
|
Thu 11:40 |
EAT-C: Environment-Adversarial sub-Task Curriculum for Efficient Reinforcement Learning Shuang Ao · Tianyi Zhou · Jing Jiang · Guodong Long · Xuan Song · Chengqi Zhang |