Skip to yearly menu bar Skip to main content


Poster

Accelerating Iterative Retrieval-augmented Language Model Serving with Speculation

Zhihao Zhang · Alan Zhu · Lijie Yang · Yihua Xu · Lanting Li · Phitchaya Phothilimthana · Zhihao Jia
2024 Poster

Abstract

Chat is not available.