Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Pluralistic Alignment Workshop

Side Effects of Character Training: Quantifying Cross-Constitution Drift in LLMs

Bhagyesh Kumar ⋅ Ananya Sutradhar ⋅ Saurav Panigrahi ⋅ Jonathn Chang ⋅ Lionel Levine

Abstract

Log in and register to view live content