Skip to yearly menu bar Skip to main content


Resolving Discrepancies in Compute-Optimal Scaling of Language Models

Tomer Porian · Mitchell Wortsman · Jenia Jitsev · Ludwig Schmidt · Yair Carmon

Abstract

Video

Chat is not available.