Skip to yearly menu bar Skip to main content


Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Roberto Castro ⋅ Andrei Panferov ⋅ Rush Tabesh ⋅ Jiale Chen ⋅ Oliver Sieberling ⋅ Mahdi Nikdan ⋅ Saleh Ashkboos ⋅ Dan Alistarh

Abstract

Chat is not available.