Skip to yearly menu bar Skip to main content


Poster

LazyAttention: Efficient Retrieval-Augmented Generation with Deferred Positional Encoding

Haocheng Xia ⋅ Mihir Pamnani ⋅ Hanxi Fang ⋅ Supawit Chockchowwat ⋅ Yongjoo Park

Abstract

Log in and register to view live content