Skip to yearly menu bar Skip to main content


Spotlight Poster Tue, Jul 15, 2025 • 4:30 PM – 7:00 PM PDT

Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies

Nadav Timor · Jonathan Mamou · Daniel Korat · Moshe Berchansky · Gaurav Jain · Oren Pereg · Moshe Wasserblat · David Harel

Abstract

Lay Summary

Video

Chat is not available.