Skip to yearly menu bar Skip to main content


Oral

Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation

D. Sculley ⋅ William Cukierski ⋅ Phil Culliton ⋅ Sohier Dane ⋅ Maggie Demkin ⋅ Ryan Holbrook ⋅ Addison Howard ⋅ Paul Mooney ⋅ Walter Reade ⋅ Meg Risdal ⋅ Nate Keating
2025 Oral

Abstract

Lay Summary

Video

Chat is not available.