Skip to yearly menu bar Skip to main content


Poster

The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination

Yifan Sun ⋅ Han Wang ⋅ Dongbai Li ⋅ Gang Wang ⋅ Huan Zhang
2025 Poster

Abstract

Lay Summary

Video

Chat is not available.