Skip to yearly menu bar Skip to main content


Poster

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

Ziniu Li · Tian Xu · Yushun Zhang · Zhihang Lin · Yang Yu · Ruoyu Sun · Zhi-Quan Luo
2024 Poster

Abstract

Chat is not available.