Skip to yearly menu bar Skip to main content


Hydragen: High-Throughput LLM Inference with Shared Prefixes

Jordan Juravsky ⋅ Bradley Brown ⋅ Ryan Ehrlich ⋅ Daniel Y Fu ⋅ Christopher Re ⋅ Azalia Mirhoseini

Abstract

Chat is not available.