Skip to yearly menu bar Skip to main content


Poster

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Guangxuan Xiao ⋅ Ji Lin ⋅ Mickael Seznec ⋅ Hao Wu ⋅ Julien Demouth ⋅ Song Han
2023 Poster

Abstract

Video

Chat is not available.