Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Workshop on Computer Use Agents

ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents

Ido Levy ⋅ Ben wiesel ⋅ Sami Marreed ⋅ Alon Oved ⋅ Avi Yaeli ⋅ Segev Shlomov

Abstract

Chat is not available.