Skip to yearly menu bar Skip to main content


The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning

Xinyu Zhu · Mengzhou Xia · Zhepei Wei · Wei-Lin Chen · Danqi Chen · Yu Meng

Abstract

Chat is not available.