Skip to yearly menu bar Skip to main content


Few-shot Steerable Alignment: Adapting Rewards and LLM Policies with Neural Processes

Katarzyna Kobalczyk ⋅ Claudio Fanconi ⋅ Hao Sun ⋅ Mihaela van der Schaar

Abstract

Video

Chat is not available.