Spotlight
|
Thu 12:50
|
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto · David Meger · Doina Precup · Ofir Nachum · Shixiang Gu
|
|
Spotlight
|
Thu 11:25
|
Fast Population-Based Reinforcement Learning on a Single Machine
Arthur Flajolet · Claire Bizon Monroc · Karim Beguir · Thomas Pierrot
|
|
Spotlight
|
Thu 11:00
|
Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
Haoqi Yuan · Zongqing Lu
|
|
Spotlight
|
Thu 13:00
|
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning
Adam Villaflor · Zhe Huang · Swapnil Pande · John Dolan · Jeff Schneider
|
|
Poster
|
Thu 15:00
|
Provably Efficient Offline Reinforcement Learning for Partially Observable Markov Decision Processes
Hongyi Guo · Qi Cai · Yufeng Zhang · Zhuoran Yang · Zhaoran Wang
|
|
Spotlight
|
Thu 11:50
|
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation)
Huang Bojun
|
|
Poster
|
Thu 15:00
|
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity
Laixi Shi · Gen Li · Yuting Wei · Yuxin Chen · Yuejie Chi
|
|
Poster
|
Thu 15:00
|
Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error
Scott Fujimoto · David Meger · Doina Precup · Ofir Nachum · Shixiang Gu
|
|
Poster
|
Thu 15:00
|
Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning
Adam Villaflor · Zhe Huang · Swapnil Pande · John Dolan · Jeff Schneider
|
|
Poster
|
Thu 15:00
|
Fast Population-Based Reinforcement Learning on a Single Machine
Arthur Flajolet · Claire Bizon Monroc · Karim Beguir · Thomas Pierrot
|
|
Poster
|
Thu 15:00
|
Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
Haoqi Yuan · Zongqing Lu
|
|
Poster
|
Thu 15:00
|
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation)
Huang Bojun
|
|