Skip to yearly menu bar Skip to main content


GapPO: Gradient-Adaptive Pairwise Preference Optimization

Michelle Chang ⋅ Xiaodi Sun ⋅ Ethan C Chau ⋅ Zhaoqiong Huang ⋅ Arpita Das ⋅ Izzie Lau ⋅ Liyuan Zheng ⋅ Huancheng Chen ⋅ Jingwen Lu

Abstract

Log in and register to view live content