Skip to yearly menu bar Skip to main content


Poster

NanoQuant: Efficient Sub-1-bit Quantization of Large Language Models

Hyochan Chong ⋅ Dongkyu Kim ⋅ Changdong Kim ⋅ Minseop choi

Abstract

Log in and register to view live content