Skip to yearly menu bar Skip to main content


Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Roberto Castro · Andrei Panferov · Rush Tabesh · Jiale Chen · Oliver Sieberling · Mahdi Nikdan · Saleh Ashkboos · Dan Alistarh

Abstract

Chat is not available.