Skip to yearly menu bar Skip to main content


Poster
in
Workshop: DIG-BUGS: Data in Generative Models (The Bad, the Ugly, and the Greats)
Sat, Jul 19, 2025 • 3:00 PM – 3:45 PM PDT

Cascading Adversarial Bias from Injection to Distillation in Language Models

Harsh Chaudhari · Jamie Hayes · Matthew Jagielski · Ilia Shumailov · Milad Nasr · Alina Oprea

Abstract

Chat is not available.