Skip to yearly menu bar Skip to main content


Poster

Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation

Danny Halawi ⋅ Alexander Wei ⋅ Eric Wallace ⋅ Tony Wang ⋅ Nika Haghtalab ⋅ Jacob Steinhardt
2024 Poster

Abstract

Chat is not available.