Skip to yearly menu bar Skip to main content


Inverse Reinforcement Learning from Demonstrations for LLM Alignment

Hao Sun · M van der Schaar

Abstract

Video

Chat is not available.