Skip to yearly menu bar Skip to main content


Oral

Transformers Learn Nonlinear Features In Context: Nonconvex Mean-field Dynamics on the Attention Landscape

Juno Kim · Taiji Suzuki
2024 Oral

Abstract

Video

Chat is not available.