Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Next Generation of AI Safety

Robust Knowledge Unlearning via Mechanistic Localizations

Phillip Guo · Aaquib Syed · Abhay Sheshadri · Aidan Ewart · Gintare Karolina Dziugaite

Abstract

Chat is not available.