Skip to yearly menu bar Skip to main content


Poster
in
Workshop: RLxF: RL from World Feedback

Training AI Co-Scientists using Rubric Rewards

Shashwat Goel ⋅ Rishi Hazra ⋅ Dulhan Jayalath ⋅ Timon Willi ⋅ Parag Jain ⋅ Shen ⋅ Ilias Leontiadis ⋅ Francesco Barbieri ⋅ Yoram Bachrach ⋅ Jonas Geiping ⋅ Chenxi Whitehouse

Abstract

Log in and register to view live content