Skip to yearly menu bar Skip to main content


VOCABTRIM: Vocabulary Pruning for Efficient Speculative Decoding in LLMs

Raghavv Goel ⋅ Sudhanshu Agrawal ⋅ Mukul Gagrani ⋅ Junyoung Park ⋅ Yifan Zao ⋅ He Zhang ⋅ Tian Liu ⋅ Yiping Yang ⋅ Xin Yuan ⋅ Jiuyuan Lu ⋅ Christopher Lott ⋅ Mingu Lee

Abstract

Chat is not available.