Skip to yearly menu bar Skip to main content


Poster

Mining Tensor/Neuron-Level Sparsity to Maximize Mixture-of-Experts Potential in Post-Training and Inference

Weilin Cai ⋅ Le Qin ⋅ Shwai He ⋅ Junwei Cui ⋅ Ang Li ⋅ Jiayi Huang

Abstract

Log in and register to view live content