Skip to yearly menu bar Skip to main content


Poster

Improving Transformer Optimization Through Better Initialization

Xiao Shi Huang ⋅ Felipe Perez ⋅ Jimmy Ba ⋅ Maksims Volkovs
2020 Poster
[ Slides

Abstract

Video

Chat is not available.