Skip to yearly menu bar Skip to main content


Poster Thu, Jul 9, 2026 • 1:00 AM – 2:45 AM PDT HALL A #2010

FAFO: Lossy KV Cache Compression for Lossless Inference Acceleration via Draftless Fumble Decoding

Hoang Anh Duy Le ⋅ Shaochen (Henry) Zhong ⋅ Yifan Lu ⋅ Yingtong Dou ⋅ Jiayi Yuan ⋅ Yu-Neng Chuang ⋅ Xiran Fan ⋅ Guanchu Wang ⋅ Yuzhong Chen ⋅ Xia Hu

Abstract

Log in and register to view live content