Skip to yearly menu bar Skip to main content


Poster

UltraHorizon: Benchmarking LLM-Agent Capabilities in Ultra Long-Horizon Scenarios

Haotian Luo ⋅ Huaisong Zhang ⋅ Xuelin Zhang ⋅ Haoyu Wang ⋅ Zeyu Qin ⋅ Wenjie Lu ⋅ Guozheng Ma ⋅ Haiying He ⋅ Yingsha Xie ⋅ Qiyang Zhou ⋅ Zixuan Hu ⋅ Hongze Mi ⋅ Yibo Wang ⋅ Naiqiang Tan ⋅ Hong Chen ⋅ Yi Fung ⋅ Chun Yuan ⋅ Li Shen

Abstract

Log in and register to view live content