Skip to yearly menu bar Skip to main content


Poster

LoRAP: Transformer Sub-Layers Deserve Differentiated Structured Compression for Large Language Models

guangyan li · Yongqiang Tang · Wensheng Zhang
2024 Poster

Abstract

Chat is not available.