Skip to yearly menu bar Skip to main content


Poster

GEMQ: Global Expert-Level Mixed-Precision Quantization for MoE LLMs

Jianing Deng ⋅ Song Wang ⋅ Dongwei Wang ⋅ Zijie Liu ⋅ Tianlong Chen ⋅ Huanrui Yang ⋅ Jingtong Hu

Abstract

Log in and register to view live content