firstbacksecondback
64 Results
Spotlight
|
Wed 7:35 |
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters Vladislav Kurenkov · Sergey Kolesnikov |
|
Spotlight
|
Thu 10:55 |
Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning Seyed Kamyar Seyed Ghasemipour · Satoshi Kataoka · Byron David · Daniel Freeman · Shixiang Gu · Igor Mordatch |
|
Spotlight
|
Thu 13:10 |
Off-Policy Evaluation for Large Action Spaces via Embeddings Yuta Saito · Thorsten Joachims |
|
Spotlight
|
Wed 11:20 |
Actor-Critic based Improper Reinforcement Learning Mohammadi Zaki · Avi Mohan · Aditya Gopalan · Shie Mannor |
|
Spotlight
|
Wed 11:25 |
On the Sample Complexity of Learning Infinite-horizon Discounted Linear Kernel MDPs Yuanzhou Chen · Jiafan He · Quanquan Gu |
|
Spotlight
|
Thu 11:05 |
Temporal Difference Learning for Model Predictive Control Nicklas Hansen · Hao Su · Xiaolong Wang |
|
Poster
|
Thu 15:00 |
Blocks Assemble! Learning to Assemble with Large-Scale Structured Reinforcement Learning Seyed Kamyar Seyed Ghasemipour · Satoshi Kataoka · Byron David · Daniel Freeman · Shixiang Gu · Igor Mordatch |
|
Poster
|
Wed 15:30 |
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters Vladislav Kurenkov · Sergey Kolesnikov |
|
Spotlight
|
Thu 13:45 |
Thresholded Lasso Bandit Kaito Ariu · Kenshi Abe · Alexandre Proutiere |
|
Poster
|
Thu 15:00 |
Off-Policy Evaluation for Large Action Spaces via Embeddings Yuta Saito · Thorsten Joachims |
|
Poster
|
Wed 15:30 |
Actor-Critic based Improper Reinforcement Learning Mohammadi Zaki · Avi Mohan · Aditya Gopalan · Shie Mannor |
|
Spotlight
|
Thu 8:45 |
Branching Reinforcement Learning Yihan Du · Wei Chen |