Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Pluralistic Alignment Workshop

Innocuous-Seeming Data, Latent Ideology: Ideological Generalisation in Finetuned LLMs

Robert Graham ⋅ Edward Stevinson ⋅ Yariv Barsheshat

Abstract

Log in and register to view live content