Skip to yearly menu bar Skip to main content


Oral

Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time

Zichang Liu ⋅ Jue Wang ⋅ Tri Dao ⋅ Tianyi Zhou ⋅ Binhang Yuan ⋅ Zhao Song ⋅ Anshumali Shrivastava ⋅ Ce Zhang ⋅ Yuandong Tian ⋅ Christopher Re ⋅ Beidi Chen
2023 Oral
[ PDF

Abstract

Video

Chat is not available.