Skip to yearly menu bar Skip to main content


Compressing Large Language Models to Any Size Without Re-Computation

Martin Genzel ⋅ Patrick Putzky ⋅ Pengfei Zhao ⋅ Sebastian Schulze ⋅ Mattes Mollenhauer ⋅ Robert Seidel ⋅ Stefan Dietzel ⋅ Thomas Wollmann

Abstract

Chat is not available.