Skip to yearly menu bar Skip to main content


Beyond Accuracy: A Policy Gradient Reweighting Approach for Pass@K Maximization in LLMs

Sadegh Mahdavi ⋅ Muchen Li ⋅ Kaiwen Liu ⋅ Renjie Liao ⋅ Christos Thrampoulidis

Abstract

Chat is not available.