Skip to yearly menu bar Skip to main content


Poster

Group Distributionally Robust Optimization-Driven RL for LLM Reasoning

Kishan Panaganti ⋅ Zhenwen Liang ⋅ Wenhao Yu ⋅ Haitao Mi ⋅ Dong Yu

Abstract

Log in and register to view live content