Skip to yearly menu bar Skip to main content


Poster

Can Looped Transformers Learn to Implement Multi-step Gradient Descent for In-context Learning?

Khashayar Gatmiry · Nikunj Saunshi · Sashank J. Reddi · Stefanie Jegelka · Sanjiv Kumar
2024 Poster

Abstract

Chat is not available.