Skip to yearly menu bar Skip to main content


Poster Tue, Jul 7, 2026 • 10:30 PM – 12:15 AM PDT HALL A #2314

AdaHC: Accelerating Multi-Token Prediction with Adaptive Head Chunking with Pipeline Parallelism

Yan Wang ⋅ Chang Si ⋅ Kaiming Yang ⋅ Zhipeng Zhang ⋅ Weijian Liu ⋅ Man Yuan ⋅ Mingzhen Li ⋅ Yong Li ⋅ Weile Jia

Abstract

Log in and register to view live content