Skip to yearly menu bar Skip to main content


Poster

Inference-Aware Meta-Alignment of LLMs via Non-Linear GRPO

Shokichi Takakura ⋅ Akifumi Wachi ⋅ Rei Higuchi ⋅ Kohei Miyaguchi ⋅ Taiji Suzuki

Abstract

Log in and register to view live content