Skip to yearly menu bar Skip to main content


The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning

Xinyu Zhu ⋅ Mengzhou Xia ⋅ Zhepei Wei ⋅ Wei-Lin Chen ⋅ Danqi Chen ⋅ Yu Meng

Abstract

Chat is not available.