Skip to yearly menu bar Skip to main content


Oral presentation
in
Workshop: Methods and Opportunities at Small Scale (MOSS)

Do Larger Language Models Imply Better Generalization? A Pretraining Scaling Law for Implicit Reasoning

Xinyi Wang · Shawn Tan · Mingyu Jin · William Wang · Rameswar Panda · Yikang Shen
2025 Oral presentation
in
Workshop: Methods and Opportunities at Small Scale (MOSS)

Abstract

Video

Chat is not available.