Skip to yearly menu bar Skip to main content


Poster

TRIP-Bench: A Benchmark for Long-Horizon Interactive Agents in Real-World Scenarios

Yuanzhe Shen ⋅ Zisu Huang ⋅ Zhengyuan Wang ⋅ Muzhao Tian ⋅ Zhengkang Guo ⋅ Chenyang Zhang ⋅ Shuaiyu Zhou ⋅ Zengjie Hu ⋅ Dailin Li ⋅ Kaimin Wang ⋅ Wenhao Liu ⋅ Tianlong Li ⋅ feng hong ⋅ Cao Liu ⋅ Ke Zeng

Abstract

Log in and register to view live content