firstbacksecondback
13 Results
Workshop
|
Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training Hong Liu · Zhiyuan Li · David Hall · Percy Liang · Tengyu Ma |