Skip to yearly menu bar Skip to main content


Poster

Natural Language Actor–Critic Is Bilevel: Learning to Reason with Textual Feedback

Utsav Singh ⋅ Sidhaarth Murali ⋅ Souradip Chakraborty ⋅ Amrit Singh Bedi

Abstract

Log in and register to view live content