Skip to yearly menu bar Skip to main content


Poster

MERIT: Maximum-normalized Element-wise Ratio for Language Model Large-batch Training

Yang Luo ⋅ Zangwei Zheng ⋅ Ziheng Qin ⋅ Zirui Zhu ⋅ Yong Liu ⋅ Yang You
2025 Poster

Abstract

Lay Summary

Video

Chat is not available.