Skip to yearly menu bar Skip to main content


(4 events)   Timezone:  
Show all
The 2026 schedule is still incomplete
Toggle Poster Visibility
Oral
Thu Jul 09 04:00 PM -- 04:15 PM (KST) None
$\tau^2$-Bench: Evaluating Conversational Agents in a Dual-Control Environment
Victor Barres ⋅ Honghua Dong ⋅ Soham Ray ⋅ Xujie Si ⋅ Karthik Narasimhan
[ OpenReview
Oral
Thu Jul 09 04:15 PM -- 04:30 PM (KST) None
Characterizing Agents in Production
Melissa Pan ⋅ Negar Arabzadeh ⋅ Riccardo Cogo ⋅ Yuxuan Zhu ⋅ Alexander Xiong ⋅ Lakshya A Agrawal ⋅ Huanzhi Mao ⋅ Emma Shen ⋅ Sid Pallerla ⋅ Liana Patel ⋅ Shu Liu ⋅ Tianneng Shi ⋅ Xiaoyuan Liu ⋅ Jared Davis ⋅ Emmanuele Lacavalla ⋅ Alessandro Basile ⋅ Shuyi Yang ⋅ Paul Castro ⋅ Daniel Kang ⋅ Koushik Sen ⋅ Dawn Song ⋅ Joseph E Gonzalez ⋅ Ion Stoica ⋅ Matei Zaharia ⋅ Marquita Ellis
[ OpenReview
Oral
Thu Jul 09 04:30 PM -- 04:45 PM (KST) None
CVE-Factory: Scaling Expert-Level Agentic Tasks for Code Security Vulnerability
Xianzhen Luo ⋅ Jingyuan Zhang ⋅ Shiqi Zhou ⋅ JinYang Huang ⋅ Chuan Xiao ⋅ Qingfu Zhu ⋅ Zhiyuan Ma ⋅ YUE XING ⋅ Yang Yue ⋅ WencongZeng ⋅ Wanxiang Che
[ OpenReview
Oral
Thu Jul 09 04:45 PM -- 05:00 PM (KST) None
OMAC: A Holistic Optimization Framework for LLM-Based Multi-Agent Collaboration
Shijun Li ⋅ Hilaf Hasson ⋅ Joydeep Ghosh
[ OpenReview