Skip to yearly menu bar Skip to main content


Mitigating Fine-tuning Risks in LLMs via Safety-Aware Probing Optimization

Chengcan Wu · Zhixin Zhang · Zeming Wei · Yihao Zhang · Meng Sun

Abstract

Chat is not available.