Skip to yearly menu bar Skip to main content


Poster

Principled Reinforcement Learning with Human Feedback from Pairwise or K-wise Comparisons

Banghua Zhu · Michael Jordan · Jiantao Jiao
2023 Poster
[ PDF [ Poster

Abstract

Video

Chat is not available.