Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Next Generation of AI Safety

Robust Knowledge Unlearning via Mechanistic Localizations

Phillip Guo ⋅ Aaquib Syed ⋅ Abhay Sheshadri ⋅ Aidan Ewart ⋅ Gintare Karolina Dziugaite

Abstract

Chat is not available.