Skip to yearly menu bar Skip to main content


Poster

HexGen-3: A Fully Disaggregated LLM Serving Framework with Fine-Grained Heterogeneous Resource Autoscaling

Youhe Jiang ⋅ Wenshuang Li ⋅ You Peng ⋅ Jintao Zhang ⋅ Ran Yan ⋅ Jianfei Chen ⋅ Xu Han ⋅ Fangcheng Fu ⋅ Binhang Yuan

Abstract

Log in and register to view live content