Skip to yearly menu bar Skip to main content


Poster Wed, Jul 8, 2026 • 2:30 PM – 4:15 PM KST Coex: HALL A

CoRe: Combined Rewards with Vision-Language Model Feedback for Preference-Aligned Reinforcement Learning

Hexian Ni ⋅ Tao Lu ⋅ Yinghao Cai

Abstract

Log in and register to view live content