firstbacksecondback
72 Results
Spotlight
|
Wed 7:35 |
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters Vladislav Kurenkov · Sergey Kolesnikov |
|
Poster
|
Tue 15:30 |
Offline RL Policies Should Be Trained to be Adaptive Dibya Ghosh · Anurag Ajay · Pulkit Agrawal · Sergey Levine |
|
Oral
|
Tue 11:15 |
Offline RL Policies Should Be Trained to be Adaptive Dibya Ghosh · Anurag Ajay · Pulkit Agrawal · Sergey Levine |
|
Spotlight
|
Tue 11:55 |
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning Shentao Yang · Yihao Feng · Shujian Zhang · Mingyuan Zhou |
|
Spotlight
|
Thu 13:00 |
Imitation Learning by Estimating Expertise of Demonstrators Mark Beliaev · Andy Shih · Stefano Ermon · Dorsa Sadigh · Ramtin Pedarsani |
|
Poster
|
Wed 15:30 |
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters Vladislav Kurenkov · Sergey Kolesnikov |
|
Poster
|
Tue 15:30 |
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning Shentao Yang · Yihao Feng · Shujian Zhang · Mingyuan Zhou |
|
Spotlight
|
Thu 11:55 |
On the Role of Discount Factor in Offline Reinforcement Learning Hao Hu · yiqin yang · Qianchuan Zhao · Chongjie Zhang |
|
Spotlight
|
Thu 13:10 |
Off-Policy Evaluation for Large Action Spaces via Embeddings Yuta Saito · Thorsten Joachims |
|
Poster
|
Thu 15:00 |
Imitation Learning by Estimating Expertise of Demonstrators Mark Beliaev · Andy Shih · Stefano Ermon · Dorsa Sadigh · Ramtin Pedarsani |
|
Poster
|
Thu 15:00 |
Off-Policy Evaluation for Large Action Spaces via Embeddings Yuta Saito · Thorsten Joachims |
|
Poster
|
Thu 15:00 |
On the Role of Discount Factor in Offline Reinforcement Learning Hao Hu · yiqin yang · Qianchuan Zhao · Chongjie Zhang |