Skip to yearly menu bar Skip to main content


Poster

$\tau^2$-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Victor Barres ⋅ Honghua Dong ⋅ Soham Ray ⋅ Xujie Si ⋅ Karthik Narasimhan

Abstract

Log in and register to view live content