Skip to yearly menu bar Skip to main content


Poster Thu, Jul 9, 2026 • 1:00 AM – 2:45 AM PDT HALL A #2103

Coupled Variational Reinforcement Learning for Language Model General Reasoning

Xueru Wen ⋅ Jie Lou ⋅ Yanjiang Liu ⋅ Hongyu Lin ⋅ Ben He ⋅ Xianpei Han ⋅ Le Sun ⋅ Yaojie Lu ⋅ Debing Zhang

Abstract

Log in and register to view live content