Skip to yearly menu bar Skip to main content


Invited Talk
in
Workshop: ES-FoMo II: 2nd Workshop on Efficient Systems for Foundation Models

Efficient Quantization Methods and Marlin, a Fast 4-Bit Inference Kernel

Elias Frantar


Abstract:

Chat is not available.