Skip to yearly menu bar Skip to main content


Oral
in
Workshop: DIG-BUGS: Data in Generative Models (The Bad, the Ugly, and the Greats)
Sat, Jul 19, 2025 • 9:45 AM – 10:00 AM PDT

Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets

Lei Hsiung · Tianyu Pang · Yung-Chen Tang · Linyue Song · Tsung-Yi Ho · Pin-Yu Chen · Yaoqing Yang

Abstract

Video

Chat is not available.