Skip to yearly menu bar Skip to main content


Poster

Can Looped Transformers Learn to Implement Multi-step Gradient Descent for In-context Learning?

Khashayar Gatmiry ⋅ Nikunj Saunshi ⋅ Sashank J. Reddi ⋅ Stefanie Jegelka ⋅ Sanjiv Kumar
2024 Poster

Abstract

Chat is not available.