Skip to yearly menu bar Skip to main content


Poster

AlphaPO: Reward Shape Matters for LLM Alignment

Aman Gupta ⋅ Shao Tang ⋅ Qingquan Song ⋅ Sirou Zhu ⋅ Jiwoo Hong ⋅ Ankan Saha ⋅ Viral Gupta ⋅ Noah Lee ⋅ Eunki Kim ⋅ Siyu Zhu ⋅ Parag Agrawal ⋅ Natesh Pillai ⋅ Sathiya Keerthi
2025 Poster

Abstract

Lay Summary

Video

Chat is not available.