Workshop
|
Fri 8:00
|
PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
Utsav Singh · Wesley A. Suttle · Brian Sadler · Vinay Namboodiri · Amrit Singh Bedi
|
|
Workshop
|
|
Unavoidable Learning Constraints Alter the Foundations of Direct Preference Optimization
David Wipf
|
|
Workshop
|
Fri 8:00
|
Uncertainty-aware Preference Alignment in Reinforcement Learning from Human Feedback
Sheng Xu · Bo Yue · Hongyuan Zha · Guiliang Liu
|
|
Workshop
|
Sat 2:20
|
Sarah Dean: Learning preference dynamics from partial observations
Sarah Dean
|
|
Workshop
|
Fri 8:00
|
Comparing Few to Rank Many: Optimal Design for Learning Preferences
Kiran Thekumparampil · Gaurush Hiranandani · Kousha Kalantari · Shoham Sabach · Branislav Kveton
|
|
Poster
|
Thu 2:30
|
PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
Utsav Singh · Wesley A. Suttle · Brian Sadler · Vinay Namboodiri · Amrit Singh Bedi
|
|
Poster
|
Thu 4:30
|
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
Shentao Yang · Tianqi Chen · Mingyuan Zhou
|
|
Workshop
|
Fri 5:30
|
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning (outstanding paper)
|
|
Workshop
|
Fri 8:00
|
Preference Elicitation for Offline Reinforcement Learning
Alizée Pace · Bernhard Schölkopf · Gunnar Ratsch · Giorgia Ramponi
|
|
Workshop
|
|
Preference Elicitation for Offline Reinforcement Learning
Alizée Pace · Bernhard Schölkopf · Gunnar Ratsch · Giorgia Ramponi
|
|
Workshop
|
Fri 8:00
|
DPM: Dual Preferences-based Multi-Agent Reinforcement Learning
Sehyeok Kang · Yongsik Lee · Se-Young Yun
|
|
Workshop
|
|
Hummer: Towards Limited Competitive Preference Dataset
Li Jiang · Yusen Wu · Junwu Xiong · Jingqing Ruan · Qingpei Guo · zujie wen · JUN ZHOU · Xiaotie Deng
|
|