Skip to yearly menu bar Skip to main content


Poster

SpanNorm: Reconciling Training Stability and Performance in Deep Transformers

Chao Wang ⋅ Bei Li ⋅ Jiaqi Zhang ⋅ Xinyu Liu ⋅ Yuchun Fan ⋅ Linkun Lyu ⋅ Xin Chen ⋅ Jingang Wang ⋅ Tong Xiao ⋅ Peng Pei ⋅ Xunliang Cai

Abstract

Log in and register to view live content