Skip to yearly menu bar Skip to main content


Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding

Kaiyan Zhang · Jianyu Wang · Ning Ding · Biqing Qi · Ermo Hua · Xingtai Lv · Bowen Zhou

Abstract

Chat is not available.