Skip to yearly menu bar Skip to main content


SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors

Tiancheng Hu ⋅ Joachim Baumann ⋅ Lorenzo Lupo ⋅ Nigel Collier ⋅ Dirk Hovy ⋅ Paul Röttger

Abstract

Chat is not available.