Skip to yearly menu bar Skip to main content


Poster

Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models

Siyan Zhao ⋅ Zhihui Xie ⋅ Mengchen Liu ⋅ Jing Huang ⋅ Guan Pang ⋅ Feiyu Chen ⋅ Aditya Grover

Abstract

Log in and register to view live content