Skip to yearly menu bar Skip to main content


Too Big to Think: Capacity, Memorization, and Generalization in Pre-Trained Transformers

Joshua Barron · Devin White

Abstract

Video

Chat is not available.