Skip to yearly menu bar Skip to main content


Poster

AdaHC: Accelerating Multi-Token Prediction with Adaptive Head Chunking with Pipeline Parallelism

Yan Wang ⋅ Chang Si ⋅ Kaiming Yang ⋅ Zhipeng Zhang ⋅ Weijian Liu ⋅ Man Yuan ⋅ Mingzhen Li ⋅ Yong Li ⋅ Weile Jia

Abstract

Log in and register to view live content