Skip to yearly menu bar Skip to main content


(4 events)   Timezone:  
Show all
The 2026 schedule is still incomplete
Toggle Poster Visibility
Oral
Tue Jul 07 10:00 AM -- 10:15 AM (KST) None
Benchmarking at the Edge of Comprehension
Samuele Marro ⋅ Jialin Yu ⋅ Emanuele La Malfa ⋅ Oishi Deb ⋅ Jiawei Li ⋅ Yibo Yang ⋅ Ebey Abraham ⋅ Sunando Sengupta ⋅ Eric Sommerlade ⋅ Michael Wooldridge ⋅ Phil Torr
[ OpenReview
Oral
Tue Jul 07 10:15 AM -- 10:30 AM (KST) None
daVinci-Dev: Agent-native Mid-training for Software Engineering
Ji Zeng ⋅ Dayuan Fu ⋅ Tiantian Mi ⋅ Zhuang Yumin ⋅ Yaxing Huang ⋅ Xuefeng Li ⋅ Lyumanshan Ye ⋅ Muhang Xie ⋅ Qishuo Hua ⋅ Zhen Huang ⋅ Mohan Jiang ⋅ Hanning Wang ⋅ Shijie Xia ⋅ Yang Xiao ⋅ Jie Sun ⋅ Yunze Wu ⋅ Pengfei Liu
[ OpenReview
Oral
Tue Jul 07 10:30 AM -- 10:45 AM (KST) None
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
Lukasz Borchmann ⋅ Jordy Van Landeghem ⋅ Michał Turski ⋅ Shreyansh Padarha ⋅ Ryan Kearns ⋅ Adam Mahdi ⋅ Niels Rogge ⋅ Clémentine Fourrier ⋅ Siwei Han ⋅ Huaxiu Yao ⋅ Artemis Llabrés ⋅ Yiming Xu ⋅ Dimosthenis Karatzas ⋅ Hao Zhang ⋅ Anupam Datta
[ OpenReview
Oral
Tue Jul 07 10:45 AM -- 11:00 AM (KST) None
VenusBench-Mobile: A Challenging and User-Centric Benchmark for Mobile GUI Agents with Capability Diagnostics
Yichen Gong ⋅ Zhuohan Cai ⋅ Sunhao Dai ⋅ Yuqi Zhou ⋅ Zhangxuan Gu ⋅ Changhua Meng ⋅ Shuheng Shen
[ OpenReview