Skip to yearly menu bar Skip to main content


Poster

R-Diverse: Mitigating Diversity Illusion in Self-Play LLM Training

Gengsheng Li ⋅ Jinghan He ⋅ Shijie Wang ⋅ Dan Zhang ⋅ Ruiqi Liu ⋅ Renrui Zhang ⋅ Zijun Yao ⋅ Junfeng Fang ⋅ Haiyun Guo ⋅ Jinqiao Wang

Abstract

Log in and register to view live content