Skip to yearly menu bar Skip to main content


Efficient Quantization Methods and Marlin, a Fast 4-Bit Inference Kernel

Elias Frantar

Video

Chat is not available.