Skip to yearly menu bar Skip to main content


Poster

Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers

Zecheng Tang ⋅ Quantong Qiu ⋅ Yi Yang ⋅ Zhiyi Hong ⋅ Haiya Xiang ⋅ Kebin Liu ⋅ Qingqing Dang ⋅ Juntao Li ⋅ Min zhang

Abstract

Log in and register to view live content