Skip to yearly menu bar Skip to main content


Poster Tue, Jul 7, 2026 • 10:30 AM – 12:15 PM KST Coex: HALL A

Scheduling LLM Inference with Uncertainty-Aware Output Length Predictions

Haoyu Zheng ⋅ Yongqiang Zhang ⋅ Fangcheng Fu ⋅ Xiaokai Zhou ⋅ Hao Luo ⋅ Hongchao Zhu ⋅ Yuanyuan Zhu ⋅ Hao Wang ⋅ Xiao Yan ⋅ Jiawei Jiang

Abstract

Log in and register to view live content