Skip to yearly menu bar Skip to main content


Poster

IndexMem: Learned KV-Cache Eviction with Latent Memory for Long-Context LLM Inference

Xintong Yang ⋅ Hao Gu ⋅ Binxing Xu ⋅ Lujun Li ⋅ Bei Liu ⋅ Jiacheng Liu ⋅ Qiyuan Zhu ⋅ Sirui Han ⋅ Yike Guo

Abstract

Log in and register to view live content