Skip to yearly menu bar Skip to main content


Poster

Coupled Variational Reinforcement Learning for Language Model General Reasoning

Xueru Wen ⋅ Jie Lou ⋅ Yanjiang Liu ⋅ Hongyu Lin ⋅ Ben He ⋅ Xianpei Han ⋅ Le Sun ⋅ Yaojie Lu ⋅ Debing Zhang

Abstract

Log in and register to view live content