Skip to yearly menu bar Skip to main content


Poster

Oracle-MoE: Locality-preserving Routing in the Oracle Space for Memory-constrained Large Language Model Inference

Jixian Zhou ⋅ Fang DONG(董方) ⋅ Ruijun Huang ⋅ Hengjie Cao ⋅ Mengyi Chen ⋅ Yifeng Yang ⋅ Anrui Chen ⋅ Mingzhi Dong ⋅ Yujiang Wang ⋅ Dongsheng Li ⋅ David Clifton ⋅ Qin Lv ⋅ Rui Zhu ⋅ Chun Zhang ⋅ Fan Yang ⋅ Tun Lu ⋅ Ning Gu ⋅ Li Shang
2025 Poster

Abstract

Lay Summary

Video

Chat is not available.