Invited Talk
in
Workshop: ES-FoMo II: 2nd Workshop on Efficient Systems for Foundation Models
Efficient Quantization Methods and Marlin, a Fast 4-Bit Inference Kernel
Elias Frantar
Fri 26 Jul 12:01 a.m. PDT
— 12:30 a.m. PDT
Abstract:
Chat is not available.