Skip to yearly menu bar Skip to main content


Spotlight

Staged Training for Transformer Language Models

Sheng Shen ⋅ Pete Walsh ⋅ Kurt Keutzer ⋅ Jesse Dodge ⋅ Matthew Peters ⋅ Iz Beltagy
2022 Spotlight

Abstract

Video

Chat is not available.