Skip to yearly menu bar Skip to main content


Efficient Training of Language Models with Compact and Consistent Next Token Distributions

Ashutosh Sathe · Sunita Sarawagi

Abstract

Chat is not available.