Skip to yearly menu bar Skip to main content


Resolving Discrepancies in Compute-Optimal Scaling of Language Models

Tomer Porian ⋅ Mitchell Wortsman ⋅ Jenia Jitsev ⋅ Ludwig Schmidt ⋅ Yair Carmon

Abstract

Video

Chat is not available.