Skip to yearly menu bar Skip to main content


Poster

Prioritize the Process, Not Just the Outcome: Rewarding Latent Thought Trajectories Improves Reasoning in Looped Language Models

Jonathan Williams ⋅ Esin Tureci ⋅ Olga Russakovsky

Abstract

Log in and register to view live content