Skip to yearly menu bar Skip to main content


Oral Wed, Jul 16, 2025 • 3:45 PM – 4:00 PM PDT

Position: Medical Large Language Model Benchmarks Should Prioritize Construct Validity

Ahmed Alaa · Thomas Hartvigsen · Niloufar Golchini · Shiladitya Dutta · Frances Dean · Inioluwa Raji · Travis Zack

Abstract

Lay Summary

Video

Chat is not available.