Skip to yearly menu bar Skip to main content


Poster Thu, Jul 17, 2025 • 4:30 PM – 7:00 PM PDT

AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization

Junkang Wu · xue wang · Zhengyi Yang · Jiancan Wu · Jinyang Gao · Bolin Ding · Xiang Wang · Xiangnan He

Abstract

Lay Summary

Video

Chat is not available.