Skip to yearly menu bar Skip to main content


Poster

Softplus Attention with Re-weighting Boosts Length Extrapolation in Large Language Models

Bo Gao ⋅ Michael Spratling ⋅ Letizia Gionfrida

Abstract

Log in and register to view live content