Skip to yearly menu bar Skip to main content


Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding

Kaiyan Zhang ⋅ Jianyu Wang ⋅ Ning Ding ⋅ Biqing Qi ⋅ Ermo Hua ⋅ Xingtai Lv ⋅ Bowen Zhou

Abstract

Chat is not available.