Skip to yearly menu bar Skip to main content


Poster

EntroKV: Entropy-Guided Dynamic Budget Allocation for KV-Cache Compression

Wenhao Gao ⋅ Haoran Cao ⋅ Yueyan Li ⋅ YongGao Xiao ⋅ Caixia Yuan ⋅ Xiaojie Wang

Abstract

Log in and register to view live content