Skip to yearly menu bar Skip to main content


Poster Wed, Jul 16, 2025 • 11:00 AM – 1:30 PM PDT

MERIT: Maximum-normalized Element-wise Ratio for Language Model Large-batch Training

Yang Luo · Zangwei Zheng · Ziheng Qin · Zirui Zhu · Yong Liu · Yang You

Abstract

Lay Summary

Video

Chat is not available.