Skip to yearly menu bar Skip to main content


Poster

PatternKV: Flattening KV Representation Expands Quantization Headroom

Ji Zhang ⋅ Yiwei Li ⋅ Shaoxiong Feng ⋅ Peiwen Yuan ⋅ Xinglin Wang ⋅ Yueqi Zhang ⋅ Jiayi Shi ⋅ Chuyi Tan ⋅ Boyuan Pan ⋅ Yao Hu ⋅ Kan Li

Abstract

Log in and register to view live content