Skip to yearly menu bar Skip to main content


When Do Transformers Outperform Feedforward and Recurrent Networks? A Statistical Perspective

Alireza Mousavi-Hosseini ⋅ Clayton Sanford ⋅ Denny Wu ⋅ Murat Erdogdu

Abstract

Chat is not available.