Skip to yearly menu bar Skip to main content


Invited Talk
in
Workshop: ES-FoMo II: 2nd Workshop on Efficient Systems for Foundation Models

Efficient Quantization Methods and Marlin, a Fast 4-Bit Inference Kernel

Elias Frantar

Fri 26 Jul 12:01 a.m. PDT — 12:30 a.m. PDT

Abstract:

Chat is not available.