Skip to yearly menu bar Skip to main content


Poster

Transforming and Combining Rewards for Aligning Large Language Models

Zihao Wang · Chirag Nagpal · Jonathan Berant · Jacob Eisenstein · Alexander D'Amour · Sanmi Koyejo · Victor Veitch
2024 Poster

Abstract

Chat is not available.