Skip to yearly menu bar Skip to main content


Poster

Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation

Danny Halawi · Alexander Wei · Eric Wallace · Tony Wang · Nika Haghtalab · Jacob Steinhardt
2024 Poster

Abstract

Chat is not available.