Skip to yearly menu bar Skip to main content


(4 events)   Timezone:  
Show all
The 2026 schedule is still incomplete
Toggle Poster Visibility
Oral
Wed Jul 08 10:00 AM -- 10:15 AM (KST) None
Mitigating Reward Hacking in RLHF via Bayesian Non-negative Reward Modeling
Zhibin Duan ⋅ Guowei Rong ⋅ Zhuo Li ⋅ Bo Chen ⋅ Mingyuan Zhou ⋅ Dandan Guo
[ OpenReview
Oral
Wed Jul 08 10:15 AM -- 10:30 AM (KST) None
Reinforcement Learning with Evolving Rubrics for Deep Research
Rulin Shao ⋅ Akari Asai ⋅ Shannon Shen ⋅ Hamish Ivison ⋅ Varsha Kishore ⋅ Jingming Zhuo ⋅ Xinran Zhao ⋅ Molly Park ⋅ Samuel Finlayson ⋅ David Sontag ⋅ Tyler Murray ⋅ Sewon Min ⋅ Pradeep Dasigi ⋅ Luca Soldaini ⋅ Faeze Brahman ⋅ Scott Yih ⋅ Sherry Wu ⋅ Luke Zettlemoyer ⋅ Yoon Kim ⋅ Hannaneh Hajishirzi ⋅ Pang Wei Koh
[ OpenReview
Oral
Wed Jul 08 10:30 AM -- 10:45 AM (KST) None
Simultaneous Speech-to-Speech Translation Without Aligned Data
Tom Labiausse ⋅ Romain Fabre ⋅ Yannick Estève ⋅ Alexandre Défossez ⋅ Neil Zeghidour
[ OpenReview
Oral
Wed Jul 08 10:45 AM -- 11:00 AM (KST) None
Video-Based Optimal Transport for Feedback-Efficient Offline Preference-Based Reinforcement Learning
Minh-Tung Luu ⋅ Hwanhee Kim ⋅ Younghwan Lee ⋅ Chang Yoo
[ OpenReview