Skip to yearly menu bar Skip to main content


Poster

NanoSpec: Accelerating Speculative Decoding using Minimalist In-Context Vocabularies

Zhiyang Chen ⋅ Daliang Xu ⋅ Yinyuan Zhang ⋅ Chenghua Wang ⋅ Mengwei Xu ⋅ Yun Ma

Abstract

Log in and register to view live content