Skip to yearly menu bar Skip to main content


Compressing Large Language Models to Any Size Without Re-Computation

Martin Genzel · Patrick Putzky · Pengfei Zhao · Sebastian Schulze · Mattes Mollenhauer · Robert Seidel · Stefan Dietzel · Thomas Wollmann

Abstract

Chat is not available.