Skip to yearly menu bar Skip to main content


Poster

AutoQRA: Joint Optimization of Mixed-Precision Quantization and Low-rank Adapters for Efficient LLM Fine-Tuning

Changhai Zhou ⋅ Shiyang Zhang ⋅ Yuhua Zhou ⋅ Qian Qiao ⋅ Jun Gao ⋅ Cheng Jin ⋅ KAIZHOU QIN ⋅ Weizhong Zhang

Abstract

Log in and register to view live content