Skip to yearly menu bar Skip to main content


Interpretable Reward Modeling with Active Concept Bottlenecks

Sonia Laguna ⋅ Katarzyna Kobalczyk ⋅ Julia Vogt ⋅ Mihaela van der Schaar

Abstract

Chat is not available.