Skip to yearly menu bar Skip to main content


TriLM vs FloatLM: Ternary LLMs are more Performant than Quantized FP16 LLMs

Ayush Kaushal ⋅ Tejas Vaidhya ⋅ Tejas Pandey ⋅ Aaryan Bhagat ⋅ Irina Rish

Abstract

Chat is not available.