Skip to yearly menu bar Skip to main content


Poster

Autoregressive Language Models are Secretly Energy-Based Models: Insights into the Lookahead Capabilities of Next-Token Prediction

Mathieu Blondel ⋅ Michael Sander ⋅ Germain Vivier-Ardisson ⋅ Tianlin Liu ⋅ Vincent Roulet

Abstract

Log in and register to view live content