Skip to yearly menu bar Skip to main content


Poster

The Quality-Utility Paradox: Why High-Reward Data Impairs Small Model Reasoning

Haolong Qian ⋅ Xianliang Yang ⋅ Ma yinuo ⋅ Lirong Che ⋅ Feng Lu ⋅ Ye Guo ⋅ Lei Song ⋅ Jiang Bian ⋅ Chun Yuan

Abstract

Log in and register to view live content