Skip to yearly menu bar Skip to main content


Poster Mon, Jul 6, 2026 • 10:00 PM – 11:45 PM PDT HALL A #411

Natural Language Actor–Critic Is Bilevel: Learning to Reason with Textual Feedback

Utsav Singh ⋅ Sidhaarth Murali ⋅ Souradip Chakraborty ⋅ Amrit Singh Bedi

Abstract

Log in and register to view live content