Skip to yearly menu bar Skip to main content


Poster

To Steer or Not to Steer? Mechanistic Error Reduction with Abstention for Language Models

Anna Hedström ⋅ Salim I. Amoukou ⋅ Tom Bewley ⋅ Saumitra Mishra ⋅ Manuela Veloso
2025 Poster

Abstract

Lay Summary

Video

Chat is not available.