Skip to yearly menu bar Skip to main content


Poster

HybridFlow: Resource-Adaptive Subtask Routing for Efficient Edge-Cloud LLM Inference

Jiangwen Dong ⋅ Jiayu Li ⋅ Tianhang Zheng ⋅ Wanyu LIN

Abstract

Log in and register to view live content