Skip to yearly menu bar Skip to main content


Poster Wed, Jul 8, 2026 • 6:30 PM – 8:15 PM PDT HALL A #108

Learning Useful Supervision for Reinforcement Learning in Reasoning Models

Liang CHEN ⋅ Xueting Han ⋅ Li Shen ⋅ Jing Bai ⋅ Kam-Fai Wong

Abstract

Log in and register to view live content