Skip to yearly menu bar Skip to main content


Search All 2022 Events
 

91 Results

<<   <   Page 8 of 8   >>   >
Spotlight
Tue 11:45 Supervised Off-Policy Ranking
Yue Jin · Yue Zhang · Tao Qin · Xudong Zhang · Jian Yuan · Houqiang Li · Tie-Yan Liu
Poster
Wed 15:30 PAGE-PG: A Simple and Loopless Variance-Reduced Policy Gradient Method with Probabilistic Gradient Estimation
Matilde Gargiani · Andrea Zanelli · Andrea Martinelli · Tyler Summers · John Lygeros
Spotlight
Thu 11:05 Off-Policy Reinforcement Learning with Delayed Rewards
Beining Han · Zhizhou Ren · Zuofan Wu · Yuan Zhou · Jian Peng
Poster
Tue 15:30 Supervised Off-Policy Ranking
Yue Jin · Yue Zhang · Tao Qin · Xudong Zhang · Jian Yuan · Houqiang Li · Tie-Yan Liu
Poster
Thu 15:00 Off-Policy Reinforcement Learning with Delayed Rewards
Beining Han · Zhizhou Ren · Zuofan Wu · Yuan Zhou · Jian Peng
Spotlight
Thu 11:45 Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Ruiqi Zhang · Xuezhou Zhang · Chengzhuo Ni · Mengdi Wang
Poster
Thu 15:00 Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Ruiqi Zhang · Xuezhou Zhang · Chengzhuo Ni · Mengdi Wang