Skip to yearly menu bar Skip to main content


Poster
in
Workshop: AI for Math Workshop

Teaching Large Language Models to Reason with Reinforcement Learning

Alexander Havrilla ⋅ Yuqing Du ⋅ Sharath Chandra Raparthy ⋅ Christoforos Nalmpantis ⋅ Jane Dwivedi-Yu ⋅ Eric Hambro ⋅ Sainbayar Sukhbaatar ⋅ Roberta Raileanu

Abstract

Chat is not available.