Skip to yearly menu bar Skip to main content


Poster Tue, Jul 15, 2025 • 11:00 AM – 1:30 PM PDT

TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization

Mingkang Zhu · Xi Chen · Zhongdao Wang · Bei Yu · Hengshuang Zhao · Jiaya Jia

Abstract

Lay Summary

Video

Chat is not available.