Skip to yearly menu bar Skip to main content


Poster

Attn-QAT: 4-Bit Attention With Quantization-Aware Training

Peiyuan Zhang ⋅ Matthew Noto ⋅ Wenxuan Tan ⋅ Chengquan Jiang ⋅ Will Lin ⋅ Wei Zhou ⋅ Hao Zhang

Abstract

Log in and register to view live content