Skip to yearly menu bar Skip to main content


Towards Efficient Pre-training: Exploring FP4 Precision in Large Language Models

Zhou Jiecheng ⋅ DING TANG ⋅ Rong Fu ⋅ Boni Hu ⋅ Haoran Xu ⋅ Yi Wang ⋅ zhongling su ⋅ Liang Liu ⋅ PeiZhilin ⋅ Hengjie Li ⋅ Xingcheng ZHANG ⋅ Weiming Zhang

Abstract

Chat is not available.