Skip to yearly menu bar Skip to main content


Poster

Exploring the Benefit of Activation Sparsity in Pre-training

Zhengyan Zhang ⋅ Chaojun Xiao ⋅ Qiujieli Qin ⋅ Yankai Lin ⋅ Zhiyuan Zeng ⋅ Xu Han ⋅ Zhiyuan Liu ⋅ Ruobing Xie ⋅ Maosong Sun ⋅ Jie Zhou
2024 Poster

Abstract

Chat is not available.