Skip to yearly menu bar Skip to main content


Poster

MERIT: Maximum-normalized Element-wise Ratio for Language Model Large-batch Training

Yang Luo · Zangwei Zheng · Ziheng Qin · Zirui Zhu · Yong Liu · Yang You
2025 Poster

Abstract

Lay Summary

Video

Chat is not available.