Skip to yearly menu bar Skip to main content


Poster

Vegas: Self-Speculative Decoding with Verification-Guided Sparse Attention

Yikang Yue ⋅ Yuqi Xue ⋅ Jian Huang

Abstract

Log in and register to view live content