Skip to yearly menu bar Skip to main content


Poster

Outcome-Based Rewards Do Not Guarantee Faithful and Verifiable Reasoning

Qinan Yu ⋅ Alexa Tartaglini ⋅ Peter Hase ⋅ Carlos Guestrin ⋅ Christopher Potts

Abstract

Log in and register to view live content