Skip to yearly menu bar Skip to main content


Poster Thu, Jul 9, 2026 • 10:30 AM – 12:15 PM KST Coex: HALL A

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

Hongru Hou ⋅ Tiehua Mei ⋅ Denghui Geng ⋅ Jinhui Huang ⋅ Ao Xu ⋅ Hengrui Chen ⋅ Jiaqing Liang ⋅ Deqing Yang

Abstract

Log in and register to view live content