Skip to yearly menu bar Skip to main content


Poster Thu, Jul 9, 2026 • 1:00 AM – 2:45 AM PDT HALL A #4408

Beyond Benchmarks: Toward Causally Faithful Evaluation of Large Language Models

Zhengshuyuan Tian ⋅ Chuanxin Lan ⋅ Chenxi Wang ⋅ Lei Wang ⋅ Guoxin Kang ⋅ Zhengxin Yang ⋅ Yunyou Huang ⋅ Xuehai Hong ⋅ Wanling Gao ⋅ Jianfeng Zhan

Abstract

Log in and register to view live content