Spotlight
|
Tue 14:40
|
Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis
Ziyi Chen · Yi Zhou · Rong-Rong Chen · Shaofeng Zou
|
|
Poster
|
Tue 15:30
|
Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis
Ziyi Chen · Yi Zhou · Rong-Rong Chen · Shaofeng Zou
|
|
Spotlight
|
Wed 10:35
|
PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation
Matilde Gargiani · Andrea Zanelli · Andrea Martinelli · Tyler Summers · John Lygeros
|
|
Poster
|
Wed 15:30
|
PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation
Matilde Gargiani · Andrea Zanelli · Andrea Martinelli · Tyler Summers · John Lygeros
|
|
Spotlight
|
Wed 8:40
|
Zero-Shot Reward Specification via Grounded Natural Language
Parsa Mahmoudieh · Deepak Pathak · Trevor Darrell
|
|
Poster
|
Wed 15:30
|
Zero-Shot Reward Specification via Grounded Natural Language
Parsa Mahmoudieh · Deepak Pathak · Trevor Darrell
|
|
Spotlight
|
Wed 11:15
|
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency
Qi Cai · Zhuoran Yang · Zhaoran Wang
|
|
Poster
|
Wed 15:30
|
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency
Qi Cai · Zhuoran Yang · Zhaoran Wang
|
|
Spotlight
|
Thu 11:40
|
EAT-C: Environment-Adversarial sub-Task Curriculum for Efficient Reinforcement Learning
Shuang Ao · Tianyi Zhou · Jing Jiang · Guodong Long · Xuan Song · Chengqi Zhang
|
|
Poster
|
Thu 15:00
|
EAT-C: Environment-Adversarial sub-Task Curriculum for Efficient Reinforcement Learning
Shuang Ao · Tianyi Zhou · Jing Jiang · Guodong Long · Xuan Song · Chengqi Zhang
|
|
Spotlight
|
Tue 8:20
|
Learning Infinite-horizon Average-reward Markov Decision Process with Constraints
Liyu Chen · Rahul Jain · Haipeng Luo
|
|
Poster
|
Tue 15:30
|
Learning Infinite-horizon Average-reward Markov Decision Process with Constraints
Liyu Chen · Rahul Jain · Haipeng Luo
|
|