Skip to yearly menu bar Skip to main content


Poster

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Weian Mao ⋅ Xi Lin ⋅ Wei Huang ⋅ Yuxin Xie ⋅ Tianfu Fu ⋅ Bohan Zhuang ⋅ Song Han ⋅ Yukang Chen

Abstract

Log in and register to view live content