Skip to yearly menu bar Skip to main content


Towards Efficient Pre-training: Exploring FP4 Precision in Large Language Models

Zhou Jiecheng · DING TANG · Rong Fu · Boni Hu · Haoran Xu · Yi Wang · zhongling su · Liang Liu · PeiZhilin · Hengjie Li · Xingcheng ZHANG · Weiming Zhang

Abstract

Chat is not available.