firstbacksecondback
Filter by Keyword:
604 Results
Workshop
|
Sat 15:00 |
Bad-Policy Density: A Measure of Reinforcement-Learning Hardness David Abel · Cameron Allen · Dilip Arumugam · D Ellis Hershkowitz · Michael L. Littman · Lawson Wong |
|
Workshop
|
Bad-Policy Density: A Measure of Reinforcement-Learning Hardness David Abel · Cameron Allen · Dilip Arumugam · D Ellis Hershkowitz · Michael L. Littman · Lawson Wong |
||
Workshop
|
Constraints Penalized Q-Learning for Safe Offline Reinforcement Learning Haoran Xu · Xianyuan Zhan · Xiangyu Zhu |
||
Spotlight
|
Wed 7:15 |
Tightening the Dependence on Horizon in the Sample Complexity of Q-Learning Gen Li · Changxiao Cai · Yuxin Chen · Yuantao Gu · Yuting Wei · Yuejie Chi |
|
Spotlight
|
Tue 7:35 |
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL Seyed Kamyar Seyed Ghasemipour · Dale Schuurmans · Shixiang Gu |
|
Poster
|
Wed 9:00 |
Tightening the Dependence on Horizon in the Sample Complexity of Q-Learning Gen Li · Changxiao Cai · Yuxin Chen · Yuantao Gu · Yuting Wei · Yuejie Chi |
|
Spotlight
|
Wed 6:25 |
Ensemble Bootstrapping for Q-Learning Oren Peer · Chen Tessler · Nadav Merlis · Ron Meir |
|
Poster
|
Tue 9:00 |
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL Seyed Kamyar Seyed Ghasemipour · Dale Schuurmans · Shixiang Gu |
|
Poster
|
Wed 9:00 |
Ensemble Bootstrapping for Q-Learning Oren Peer · Chen Tessler · Nadav Merlis · Ron Meir |
|
Oral
|
Tue 17:00 |
PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning Angelos Filos · Clare Lyle · Yarin Gal · Sergey Levine · Natasha Jaques · Gregory Farquhar |
|
Poster
|
Tue 21:00 |
PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning Angelos Filos · Clare Lyle · Yarin Gal · Sergey Levine · Natasha Jaques · Gregory Farquhar |
|
Oral Session
|
Tue 5:00 |
Deep Reinforcement Learning 2 |