Skip to yearly menu bar Skip to main content


Oral

Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies

Nadav Timor ⋅ Jonathan Mamou ⋅ Daniel Korat ⋅ Moshe Berchansky ⋅ Gaurav Jain ⋅ Oren Pereg ⋅ Moshe Wasserblat ⋅ David Harel
2025 Oral

Abstract

Lay Summary

Video

Chat is not available.