Skip to yearly menu bar Skip to main content


Poster

When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents

Jaylen Jones ⋅ Zhehao Zhang ⋅ Yuting Ning ⋅ Eric Fosler-Lussier ⋅ Pierre-Luc St-Charles ⋅ Yoshua Bengio ⋅ Dawn Song ⋅ Yu Su ⋅ Huan Sun

Abstract

Log in and register to view live content