Skip to yearly menu bar Skip to main content


Poster

G$^2$RPO: Geometric GRPO; Escaping LLM's Reasoning Rut to Break Accuracy--Entropy Trade-off

Ali Rad ⋅ Khashayar Filom ⋅ Darioush Keivan ⋅ Peyman Mohajerin Esfahani ⋅ Ehsan Kamalinejad

Abstract

Log in and register to view live content