Skip to yearly menu bar Skip to main content


Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training

Hong Liu · Zhiyuan Li · David Hall · Percy Liang · Tengyu Ma

Abstract

Video

Chat is not available.