firstbacksecondback
75 Results
Spotlight
|
Thu 11:35 |
Optimal Estimation of Policy Gradient via Double Fitted Iteration Chengzhuo Ni · Ruiqi Zhang · Xiang Ji · Xuezhou Zhang · Mengdi Wang |
|
Poster
|
Thu 15:00 |
Optimal Estimation of Policy Gradient via Double Fitted Iteration Chengzhuo Ni · Ruiqi Zhang · Xiang Ji · Xuezhou Zhang · Mengdi Wang |
|
Spotlight
|
Thu 8:05 |
Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes Andrew Wagenmaker · Yifang Chen · Max Simchowitz · Simon Du · Kevin Jamieson |
|
Poster
|
Thu 15:00 |
Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes Andrew Wagenmaker · Yifang Chen · Max Simchowitz · Simon Du · Kevin Jamieson |
|
Spotlight
|
Thu 8:55 |
Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation ZHIHAN LIU · Yufeng Zhang · Zuyue Fu · Zhuoran Yang · Zhaoran Wang |
|
Poster
|
Thu 15:00 |
Learning from Demonstration: Provably Efficient Adversarial Policy Imitation with Linear Function Approximation ZHIHAN LIU · Yufeng Zhang · Zuyue Fu · Zhuoran Yang · Zhaoran Wang |
|
Spotlight
|
Thu 13:25 |
Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning approach Xuezhou Zhang · Yuda Song · Masatoshi Uehara · Mengdi Wang · Alekh Agarwal · Wen Sun |
|
Poster
|
Thu 15:00 |
Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning approach Xuezhou Zhang · Yuda Song · Masatoshi Uehara · Mengdi Wang · Alekh Agarwal · Wen Sun |
|
Spotlight
|
Thu 13:55 |
Learning Dynamics and Generalization in Deep Reinforcement Learning Clare Lyle · Mark Rowland · Will Dabney · Marta Kwiatkowska · Yarin Gal |
|
Poster
|
Thu 15:00 |
Learning Dynamics and Generalization in Deep Reinforcement Learning Clare Lyle · Mark Rowland · Will Dabney · Marta Kwiatkowska · Yarin Gal |
|
Spotlight
|
Wed 11:25 |
On the Sample Complexity of Learning Infinite-horizon Discounted Linear Kernel MDPs Yuanzhou Chen · Jiafan He · Quanquan Gu |
|
Poster
|
Wed 15:30 |
On the Sample Complexity of Learning Infinite-horizon Discounted Linear Kernel MDPs Yuanzhou Chen · Jiafan He · Quanquan Gu |