Skip to yearly menu bar Skip to main content


Poster

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Tong Wu ⋅ Michael Liu ⋅ Jun Bai ⋅ Zixia Jia ⋅ Shuyi Zhang ⋅ Ziyong Lin ⋅ Yanting Wang ⋅ Song-Chun Zhu ⋅ Zilong Zheng

Abstract

Log in and register to view live content