Skip to yearly menu bar Skip to main content


Poster

Staged Training for Transformer Language Models

Sheng Shen ⋅ Pete Walsh ⋅ Kurt Keutzer ⋅ Jesse Dodge ⋅ Matthew Peters ⋅ Iz Beltagy
2022 Poster

Abstract

Chat is not available.