Skip to yearly menu bar Skip to main content


Is Human-Written Data Enough? The Challenge of Teaching Reasoning to LLMs Without RL or Distillation

Wei Du ⋅ Branislav Kisacanin ⋅ George Armstrong ⋅ Shubham Toshniwal ⋅ Ivan Moshkov ⋅ Alexan Ayrapetyan ⋅ Sadegh Mahdavi ⋅ Dan Zhao ⋅ Shizhe Diao ⋅ Dragan Mašulović ⋅ Advaith Avadhanam ⋅ Max Wang ⋅ Shitij Govil ⋅ Sri Yanamandra ⋅ Mihir Tandon ⋅ Sriram Ananthakrishnan ⋅ Vedant Rathi ⋅ David Zhang ⋅ Joonseok Kang ⋅ Leon Luo ⋅ Titu Andreescu ⋅ Ashmit Dutta ⋅ Boris Ginsburg ⋅ Igor Gitman

Abstract

Chat is not available.