Skip to yearly menu bar Skip to main content


Poster

Efficient Bilevel Optimization for CKA-Guided MoE Upcycling

Zhiyuan Yu ⋅ Enneng Yang ⋅ Hao Jiang ⋅ Guojie Zhu ⋅ Feihong He ⋅ Peng Wang ⋅ Li Shen

Abstract

Log in and register to view live content