firstbacksecondback
36 Results
Spotlight
|
Thu 12:50 |
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces Amrit Singh Bedi · Souradip Chakraborty · Anjaly Parayil · Brian Sadler · Pratap Tokekar · Alec Koppel |
|
Poster
|
Thu 15:00 |
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces Amrit Singh Bedi · Souradip Chakraborty · Anjaly Parayil · Brian Sadler · Pratap Tokekar · Alec Koppel |
|
Spotlight
|
Thu 13:10 |
Off-Policy Evaluation for Large Action Spaces via Embeddings Yuta Saito · Thorsten Joachims |
|
Spotlight
|
Wed 14:30 |
Reinforcement Learning with Action-Free Pre-Training from Videos Younggyo Seo · Kimin Lee · Stephen James · Pieter Abbeel |
|
Poster
|
Thu 15:00 |
Off-Policy Evaluation for Large Action Spaces via Embeddings Yuta Saito · Thorsten Joachims |
|
Poster
|
Wed 15:30 |
Reinforcement Learning with Action-Free Pre-Training from Videos Younggyo Seo · Kimin Lee · Stephen James · Pieter Abbeel |
|
Spotlight
|
Tue 7:40 |
Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning Harley Wiltzer · David Meger · Marc Bellemare |
|
Poster
|
Tue 15:30 |
Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning Harley Wiltzer · David Meger · Marc Bellemare |
|
Spotlight
|
Wed 14:25 |
Learning Pseudometric-based Action Representations for Offline Reinforcement Learning Pengjie Gu · Mengchen Zhao · Chen Chen · Dong Li · Jianye Hao · Bo An |
|
Poster
|
Wed 15:30 |
Learning Pseudometric-based Action Representations for Offline Reinforcement Learning Pengjie Gu · Mengchen Zhao · Chen Chen · Dong Li · Jianye Hao · Bo An |
|
Spotlight
|
Thu 11:30 |
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents Wenlong Huang · Pieter Abbeel · Deepak Pathak · Igor Mordatch |
|
Poster
|
Thu 15:00 |
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents Wenlong Huang · Pieter Abbeel · Deepak Pathak · Igor Mordatch |