Skip to yearly menu bar Skip to main content


Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers

Siyu Chen ⋅ Heejune Sheen ⋅ Tianhao Wang ⋅ Zhuoran Yang

Abstract

Chat is not available.