Skip to yearly menu bar Skip to main content


Poster

Beyond Euclidean Clipping: Overcoming Exploration Collapse in LLM RL via Riemannian Isometric Policy Optimization

Zhicheng Cai ⋅ Xinyuan Guo ⋅ Hanlin Wu ⋅ Mingxuan Wang ⋅ Wei-Ying Ma ⋅ Ya-Qin Zhang ⋅ Hao Zhou

Abstract

Log in and register to view live content