Skip to yearly menu bar Skip to main content


Poster

LoKiFormer: Locality-aware Attention with Decoupled Knowledge Memory for Efficient Large Language Model Pretraining

Zimo Liu ⋅ Qiuwu Chen ⋅ Yuchen Li ⋅ Ying Sun ⋅ Yifan Zhang ⋅ Zhijie Qiu ⋅ Zeng You ⋅ Ryan Dong ⋅ Simeng Ma ⋅ Yaofo Chen ⋅ Mingkui Tan

Abstract

Log in and register to view live content