Skip to yearly menu bar Skip to main content


TMA-Adaptive FP8 Grouped GEMM: Eliminating Padding Requirements in Low-Precision Training and Inference on Hopper

zhongling su ⋅ Rong Fu ⋅ Weihan Cao ⋅ Jianfei Gao ⋅ Minxi Jin ⋅ PeiZhilin ⋅ Hui Wang

Abstract

Chat is not available.