Skip to yearly menu bar Skip to main content


Hi-ToM: A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models

Yinghui He ⋅ Yufan Wu ⋅ Yulong Chen ⋅ Naihao Deng

Abstract

Video

Chat is not available.