Skip to yearly menu bar Skip to main content


Poster

CaM: Cache Merging for Memory-efficient LLMs Inference

Yuxin Zhang ⋅ Yuxuan Du ⋅ Gen Luo ⋅ Yunshan Zhong ⋅ Zhenyu Zhang ⋅ Shiwei Liu ⋅ Rongrong Ji
2024 Poster

Abstract

Chat is not available.