Skip to yearly menu bar Skip to main content


Omni-Think: Scaling Multi-Task Learning in LLMs via Reinforcement Learning

Derek Li ⋅ Jiaming Zhou ⋅ Amirreza Kazemi ⋅ Qianyi Sun ⋅ Abbas Ghaddar ⋅ Liheng Ma ⋅ Yu Luo ⋅ Dong Li ⋅ Jianye Hao ⋅ Yingxue Zhang

Abstract

Chat is not available.