Skip to yearly menu bar Skip to main content


Interpretable Reward Modeling with Active Concept Bottlenecks

Sonia Laguna · Katarzyna Kobalczyk · Julia Vogt · Mihaela van der Schaar

Abstract

Chat is not available.