Skip to yearly menu bar Skip to main content


Poster

Sparser Block-Sparse Attention via Token Permutation

Xinghao Wang ⋅ Pengyu Wang ⋅ Dong Zhang ⋅ Chenkun Tan ⋅ Shaojun Zhou ⋅ Zhaoxiang Liu ⋅ Shiguo Lian ⋅ Fangxu Liu ⋅ Kai Song ⋅ Xipeng Qiu

Abstract

Log in and register to view live content