Skip to yearly menu bar Skip to main content


Poster

SWE-ABS: Adversarial Benchmark Strengthening Exposes Inflated Success Rates on Test-based Benchmark

Boxi Yu ⋅ Yang Cao ⋅ Yuzhong Zhang ⋅ Liting Lin ⋅ Junjielong Xu ⋅ Zhiqing Zhong ⋅ Qinghua Xu ⋅ Guancheng Wang ⋅ Jialun Cao ⋅ Shing-Chi Cheung ⋅ Pinjia He ⋅ Lionel BRIAND

Abstract

Log in and register to view live content