Skip to yearly menu bar Skip to main content


Poster Thu, Jul 17, 2025 • 4:30 PM – 7:00 PM PDT

Demystifying Cost-Efficiency in LLM Serving over Heterogeneous GPUs

Youhe Jiang · Fangcheng Fu · Xiaozhe Yao · Guoliang HE · Xupeng Miao · Ana Klimovic · Bin Cui · Binhang Yuan · Eiko Yoneki

Abstract

Lay Summary

Video

Chat is not available.