firstbacksecondback
44 Results
Spotlight
|
Wed 10:35 |
PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation Matilde Gargiani · Andrea Zanelli · Andrea Martinelli · Tyler Summers · John Lygeros |
|
Poster
|
Wed 15:30 |
PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation Matilde Gargiani · Andrea Zanelli · Andrea Martinelli · Tyler Summers · John Lygeros |
|
Spotlight
|
Thu 11:00 |
Understanding Policy Gradient Algorithms: A Sensitivity-Based Approach Shuang Wu · Ling Shi · Jun Wang · Guangjian Tian |
|
Poster
|
Thu 15:00 |
Understanding Policy Gradient Algorithms: A Sensitivity-Based Approach Shuang Wu · Ling Shi · Jun Wang · Guangjian Tian |
|
Spotlight
|
Wed 10:20 |
Analysis of Stochastic Processes through Replay Buffers Shirli Di-Castro Shashua · Shie Mannor · Dotan Di Castro |
|
Poster
|
Wed 15:30 |
Analysis of Stochastic Processes through Replay Buffers Shirli Di-Castro Shashua · Shie Mannor · Dotan Di Castro |
|
Spotlight
|
Wed 8:40 |
Zero-Shot Reward Specification via Grounded Natural Language Parsa Mahmoudieh · Deepak Pathak · Trevor Darrell |
|
Poster
|
Wed 15:30 |
Zero-Shot Reward Specification via Grounded Natural Language Parsa Mahmoudieh · Deepak Pathak · Trevor Darrell |
|
Spotlight
|
Tue 8:20 |
Learning Infinite-horizon Average-reward Markov Decision Process with Constraints Liyu Chen · Rahul Jain · Haipeng Luo |
|
Poster
|
Tue 15:30 |
Learning Infinite-horizon Average-reward Markov Decision Process with Constraints Liyu Chen · Rahul Jain · Haipeng Luo |
|
Spotlight
|
Wed 7:50 |
Generalized Data Distribution Iteration Jiajun Fan · Changnan Xiao |
|
Poster
|
Wed 15:30 |
Generalized Data Distribution Iteration Jiajun Fan · Changnan Xiao |