Skip to yearly menu bar Skip to main content


Poster

At the Edge of Understanding: Sparse Autoencoders Trace The Limits of Transformer Generalization

Praneet Suresh ⋅ Jack Stanley ⋅ Sonia Joseph ⋅ Luca Scimeca ⋅ Danilo Bzdok

Abstract

Log in and register to view live content