Skip to yearly menu bar Skip to main content


Poster

Diagnosing the Reliability of LLM-as-a-Judge via Item Response Theory

Junhyuk Choi ⋅ Sohhyung Park ⋅ chanhee cho ⋅ Hyeonchu Park ⋅ Bugeun Kim

Abstract

Log in and register to view live content