Skip to yearly menu bar Skip to main content


Spotlight

TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models

Zhuohan Li · Siyuan Zhuang · Shiyuan Guo · Danyang Zhuo · Hao Zhang · Dawn Song · Ion Stoica
2021 Spotlight

Abstract

Video

Chat is not available.