Skip to yearly menu bar Skip to main content


Hi-ToM: A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models

Yinghui He · Yufan Wu · Yulong Chen · Naihao Deng

Abstract

Video

Chat is not available.