Skip to yearly menu bar Skip to main content


Poster

CAT-Q: Cost-efficient and Accurate Ternary Quantization for LLMs

Shigeng Wang ⋅ Chao Li ⋅ Yangyuxuan Kang ⋅ Jiawei Fan ⋅ Anbang Yao

Abstract

Log in and register to view live content