Skip to yearly menu bar Skip to main content


Poster

Schur-A*: Layer-wise Optimal Expert Pruning for Sparse MoEs via Schur-Complement Guided A* Search

Zheng Chen ⋅ Yang Weifeng ⋅ Jianxiao Tang ⋅ Buhui Yao

Abstract

Log in and register to view live content