Skip to yearly menu bar Skip to main content


Oral

Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time

Zichang Liu · Jue Wang · Tri Dao · Tianyi Zhou · Binhang Yuan · Zhao Song · Anshumali Shrivastava · Ce Zhang · Yuandong Tian · Christopher Re · Beidi Chen
2023 Oral
[ PDF

Abstract

Video

Chat is not available.