Skip to yearly menu bar Skip to main content


TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning

Zhangchen Xu ⋅ Yuetai Li ⋅ Fengqing Jiang ⋅ Bhaskar Ramasubramanian ⋅ Luyao Niu ⋅ Yuchen Lin ⋅ Radha Poovendran

Abstract

Chat is not available.