Skip to yearly menu bar Skip to main content


Poster Tue, Jul 7, 2026 • 10:30 AM – 12:15 PM KST HALL A

Spurious Rewards: Rethinking Training Signals in RLVR

Rulin Shao ⋅ Stella Li ⋅ Rui Xin ⋅ Scott Geng ⋅ Yiping Wang ⋅ Sewoong Oh ⋅ Simon Du ⋅ Nathan Lambert ⋅ Sewon Min ⋅ Ranjay Krishna ⋅ Yulia Tsvetkov ⋅ Hannaneh Hajishirzi ⋅ Pang Wei Koh ⋅ Luke Zettlemoyer

Abstract

Log in and register to view live content