Skip to yearly menu bar Skip to main content


Poster

How to Fine-Tune a Reasoning Model? A Teacher–Student Cooperation Framework to Synthesize Student-Consistent SFT Data

Zixian Huang ⋅ Kaichen Yang ⋅ Xu Huang ⋅ Feiyang Hao ⋅ Qiming Ge ⋅ Bowen Li ⋅ He Du ⋅ Kai Chen ⋅ Qipeng Guo

Abstract

Log in and register to view live content