Skip to yearly menu bar Skip to main content


Poster Wed, Jul 16, 2025 • 11:00 AM – 1:30 PM PDT

Position: AI Evaluation Should Learn from How We Test Humans

Yan Zhuang · Qi Liu · Zachary Pardos · Patrick Kyllonen · Jiyun Zu · Zhenya Huang · Shijin Wang · Enhong Chen

Abstract

Lay Summary

Video

Chat is not available.