Skip to yearly menu bar Skip to main content


Principal-Driven Reward Design and Agent Policy Alignment via Bilevel-RL

Souradip Chakraborty ⋅ Amrit Bedi ⋅ Alec Koppel ⋅ Furong Huang ⋅ Mengdi Wang

Abstract

Video

Chat is not available.