Skip to yearly menu bar Skip to main content


Why do universal adversarial attacks work on large language models?: Geometry might be the answer

Varshini Subhash · Anna Bialas · Siddharth Swaroop · Weiwei Pan · Finale Doshi-Velez

Abstract

Chat is not available.