Skip to yearly menu bar Skip to main content


Poster

Memory-Efficient LLMs Training with Dynamic Sparsity: From Stability to Practical Scaling

qiao xiao ⋅ Boqian Wu ⋅ Patrik Okanovic ⋅ Tomasz Sternal ⋅ Maurice Keulen ⋅ Elena Mocanu ⋅ Mykola Pechenizkiy ⋅ Decebal Constantin Mocanu ⋅ Torsten Hoefler

Abstract

Log in and register to view live content