firstbacksecondback
91 Results
Spotlight
|
Tue 11:45 |
Supervised Off-Policy Ranking Yue Jin · Yue Zhang · Tao Qin · Xudong Zhang · Jian Yuan · Houqiang Li · Tie-Yan Liu |
|
Poster
|
Wed 15:30 |
PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation Matilde Gargiani · Andrea Zanelli · Andrea Martinelli · Tyler Summers · John Lygeros |
|
Spotlight
|
Thu 11:05 |
Off-Policy Reinforcement Learning with Delayed Rewards Beining Han · Zhizhou Ren · Zuofan Wu · Yuan Zhou · Jian Peng |
|
Poster
|
Tue 15:30 |
Supervised Off-Policy Ranking Yue Jin · Yue Zhang · Tao Qin · Xudong Zhang · Jian Yuan · Houqiang Li · Tie-Yan Liu |
|
Poster
|
Thu 15:00 |
Off-Policy Reinforcement Learning with Delayed Rewards Beining Han · Zhizhou Ren · Zuofan Wu · Yuan Zhou · Jian Peng |
|
Spotlight
|
Thu 11:45 |
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory Ruiqi Zhang · Xuezhou Zhang · Chengzhuo Ni · Mengdi Wang |
|
Poster
|
Thu 15:00 |
Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory Ruiqi Zhang · Xuezhou Zhang · Chengzhuo Ni · Mengdi Wang |