Skip to yearly menu bar Skip to main content


Poster
in
Affinity Event: The 6th Muslims in ML (MusIML) Workshop

Why Limit the Residual Stream to Layers and Not Tokens? Persistent Memory for Continuous Latent Reasoning

Mujtaba farhan ⋅ Maheep Chaudhary ⋅ Sean Wu ⋅ Ashwinee Panda

Abstract

Log in and register to view live content