Skip to yearly menu bar Skip to main content


Speeding up Speculative Decoding via Sequential Approximate Verification

Meiyu Zhong ⋅ Noel Teku ⋅ Ravi Tandon

Abstract

Chat is not available.