Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Workshop on Computer Use Agents

EconEvals: Benchmarks and Litmus Tests for LLM Agents in Unknown Environments

Sara Fish ⋅ Julia Shephard ⋅ Minkai Li ⋅ Ran Shorrer ⋅ Yannai A. Gonczarowski

Abstract

Chat is not available.