Skip to yearly menu bar Skip to main content


Poster

SciAgentGym: Benchmarking Multi-Step Scientific Tool-Use in LLM Agents

Yujiong Shen ⋅ Yajie Yang ⋅ Zhiheng Xi ⋅ Binze Hu ⋅ Huayu Sha ⋅ Qiyuan Peng ⋅ Jiazheng Zhang ⋅ Junlin Shang ⋅ Jixuan Huang ⋅ Yutao Fan ⋅ Jingqi Tong ⋅ Shihan Dou ⋅ Ming Zhang ⋅ LEI BAI ⋅ Zhenfei Yin ⋅ Tao Gui ⋅ Xingjun Ma ⋅ Qi Zhang ⋅ Xuanjing Huang ⋅ Yu-Gang Jiang

Abstract

Log in and register to view live content