Skip to yearly menu bar Skip to main content


Poster

xKV: Cross-Layer KV-Cache Compression via Aligned Singular Vector Extraction

Chi-Chih Chang ⋅ Wei-Cheng Lin ⋅ Chien-Yu Lin ⋅ Hung-Yueh Chiang ⋅ Yash Akhauri ⋅ Xilai Dai ⋅ Huiqiang Jiang ⋅ Yucheng Li ⋅ Kai-Chiang Wu ⋅ Luis Ceze ⋅ Mohamed Abdelfattah

Abstract

Log in and register to view live content