Skip to yearly menu bar Skip to main content


Speeding up Speculative Decoding via Sequential Approximate Verification

Meiyu Zhong · Noel Teku · Ravi Tandon

Abstract

Chat is not available.