Skip to yearly menu bar Skip to main content


Poster Tue, Jul 7, 2026 • 2:00 PM – 3:45 PM KST Coex: HALL A

$\textit{S}$-SPPO: Semantic-Calibrated Self-Play Preference Optimization

Xiwen Chen ⋅ Wenhui Zhu ⋅ Jingjing Wang ⋅ Peijie Qiu ⋅ Zhipeng Wang ⋅ Huayu Li ⋅ ZhengXiao He ⋅ XUANZHAO DONG ⋅ Prayag Tiwari ⋅ Mingkun Xu ⋅ Yujian Xiong ⋅ Feng Luo ⋅ Abolfazl Razi ⋅ Brendan Rappazzo ⋅ Anderson Schneider ⋅ Yuriy Nevmyvaka

Abstract

Log in and register to view live content