Skip to yearly menu bar Skip to main content


Gluon: Making Muon & Scion Great Again! (Bridging Theory and Practice of LMO-based Optimizers for LLMs)

Artem Riabinin ⋅ Egor Shulgin ⋅ Kaja Gruntkowska ⋅ Peter Richtarik

Abstract

Chat is not available.