Skip to yearly menu bar Skip to main content


Poster

Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Yichao Fu · Peter Bailis · Ion Stoica · Hao Zhang
2024 Poster

Abstract

Chat is not available.