Skip to yearly menu bar Skip to main content


Oral presentation
in
Workshop: Methods and Opportunities at Small Scale (MOSS)

Do Larger Language Models Imply Better Generalization? A Pretraining Scaling Law for Implicit Reasoning

Xinyi Wang ⋅ Shawn Tan ⋅ Mingyu Jin ⋅ William Wang ⋅ Rameswar Panda ⋅ Yikang Shen
2025 Oral presentation
in
Workshop: Methods and Opportunities at Small Scale (MOSS)

Abstract

Video

Chat is not available.