Skip to yearly menu bar Skip to main content


Poster

LC-QAT: Data-Efficient 2-Bit QAT for LLMs via Linear-Constrained Vector Quantization

Xingyu Yu ⋅ Haoyu Wang ⋅ Haiyan Zhao ⋅ Fengxiang Wang ⋅ Xu Han

Abstract

Log in and register to view live content