Skip to yearly menu bar Skip to main content


Poster

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

Shenao Zhang ⋅ Zhihan Liu ⋅ Boyi Liu ⋅ Yufeng Zhang ⋅ Yingxiang Yang ⋅ Yongfei Liu ⋅ Liyu Chen ⋅ Tao Sun ⋅ Zhaoran Wang
2025 Poster

Abstract

Lay Summary

Video

Chat is not available.