Skip to yearly menu bar Skip to main content


Poster

HARD-KV: Head-Adaptive Regularization for Decoding-time KV Compression

Yuxuan Yang ⋅ Feiyang Ren ⋅ Bowen Zeng ⋅ Dalin Zhang ⋅ Jinpeng Chen ⋅ Gang Chen ⋅ Huan Li

Abstract

Log in and register to view live content