Skip to yearly menu bar Skip to main content


Poster

Preserve-Then-Quantize: Balancing Rank Budgets for Quantization Error Reconstruction in LLMs

Yoonjun Cho ⋅ Dongjae Jeon ⋅ Soeun Kim ⋅ Moongyu Jeon ⋅ Albert No

Abstract

Log in and register to view live content