Skip to yearly menu bar Skip to main content


Poster Wed, Jul 16, 2025 • 11:00 AM – 1:30 PM PDT

MoEQuant: Enhancing Quantization for Mixture-of-Experts Large Language Models via Expert-Balanced Sampling and Affinity Guidance

Zhixuan Chen · Xing Hu · Dawei Yang · Zukang Xu · XUCHEN · Zhihang Yuan · Sifan Zhou · JiangyongYu

Abstract

Lay Summary

Video

Chat is not available.