Skip to yearly menu bar Skip to main content


Poster Wed, Jul 8, 2026 • 6:30 PM – 8:15 PM PDT HALL A #1004

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

Hongru Hou ⋅ Tiehua Mei ⋅ Denghui Geng ⋅ Jinhui Huang ⋅ Ao Xu ⋅ Hengrui Chen ⋅ Jiaqing Liang ⋅ Deqing Yang

Abstract

Log in and register to view live content