Skip to yearly menu bar Skip to main content


Selective Perturbations as a Diagnostic for Benchmark-Based LLM Comparisons

Ivan Dubrovsky ⋅ Anastasia Orlova ⋅ Nina Gubina ⋅ Illarion Iov ⋅ Irena Gureeva ⋅ Nikolay Nikitin ⋅ Alexey Zaytsev

Abstract

Log in and register to view live content