Skip to yearly menu bar Skip to main content


Poster

SPARD: Defending Harmful Fine-Tuning Attack via Safety Projection with Relevance–Diversity Data Selection

Shuhao Chen ⋅ Weisen Jiang ⋅ Yeqi Gong ⋅ Shengda Luo ⋅ Chengxiang Zhuo ⋅ Zang Li ⋅ James Kwok ⋅ Yu Zhang

Abstract

Log in and register to view live content