Toggle Poster Visibility
Oral
Wed Jul 16 03:30 PM -- 03:45 PM (PDT) @ West Ballroom A None
Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation
[
OpenReview]
Oral
Wed Jul 16 03:45 PM -- 04:00 PM (PDT) @ West Ballroom A None
Position: Medical Large Language Model Benchmarks Should Prioritize Construct Validity
[
OpenReview]
Oral
Wed Jul 16 04:00 PM -- 04:15 PM (PDT) @ West Ballroom A None
Position: Principles of Animal Cognition to Improve LLM Evaluations
[
OpenReview]
Oral
Wed Jul 16 04:15 PM -- 04:30 PM (PDT) @ West Ballroom A None
Position: Political Neutrality in AI Is Impossible — But Here Is How to Approximate It
[
OpenReview]