Skip to yearly menu bar Skip to main content


Poster Thu, Jul 9, 2026 • 10:30 AM – 12:15 PM KST HALL A

OnePO: Direct One-stage Policy Optimization for SFT-free Domain Adaptation

Junying Chen ⋅ Xinyuan Xie ⋅ Ziniu Li ⋅ Benyou Wang

Abstract

Log in and register to view live content