Skip to yearly menu bar Skip to main content


Poster

ConServe: Fine-Grained GPU Harvesting for LLM Online and Offline Co-Serving

Yifan Qiao ⋅ Shan Yu ⋅ Shu Anzai ⋅ Haoran Ma ⋅ Shuo Yang ⋅ Yang Wang ⋅ Miryung Kim ⋅ Yongji Wu ⋅ Yang Zhou ⋅ Jiarong Xing ⋅ Joseph E Gonzalez ⋅ Ion Stoica ⋅ Harry Xu

Abstract

Log in and register to view live content