Skip to yearly menu bar Skip to main content


Beyond Accuracy: A Policy Gradient Reweighting Approach for Pass@K Maximization in LLMs

Sadegh Mahdavi · Muchen Li · Kaiwen Liu · Renjie Liao · Christos Thrampoulidis

Abstract

Chat is not available.