Skip to yearly menu bar Skip to main content


Poster
in
Workshop: RLxF: RL from World Feedback

Workflow-R1: Group Sub-sequence Policy Optimization for Multi-turn Workflow Construction

Mingze Kong ⋅ Zikun Qu ⋅ Zhongquan Zhou ⋅ Pengyu Liang ⋅ Xiang Li ⋅ Zhiwei Shang ⋅ Zhi Hong ⋅ Kaiyu Huang ⋅ Zhiyong Wang ⋅ Zhongxiang Dai

Abstract

Log in and register to view live content