Skip to yearly menu bar Skip to main content


Daniel Brown: Pitfalls and paths forward when learning rewards from human feedback

Abstract

Video

Chat is not available.