Skip to yearly menu bar Skip to main content


Poster

AdaGC: Enhancing LLM Pretraining Stability via Adaptive Gradient Clipping

Guoxia Wang ⋅ Shuai Li ⋅ Congliang Chen ⋅ Jinle Zeng ⋅ Jiabin Yang ⋅ Dianhai Yu ⋅ Yanjun Ma ⋅ Li Shen

Abstract

Log in and register to view live content