Skip to yearly menu bar Skip to main content


Inverse Reinforcement Learning from Demonstrations for LLM Alignment

Hao Sun ⋅ M van der Schaar

Abstract

Video

Chat is not available.