Skip to yearly menu bar Skip to main content


(4 events)   Timezone:  
Show all
The 2026 schedule is still incomplete
Toggle Poster Visibility
Oral
Thu Jul 09 10:00 AM -- 10:15 AM (KST) None
Position: There are futures that benchmark-driven AI cannot see
Sobhan Lotfi ⋅ Ava Iranmanesh ⋅ Lachin Naghashyar ⋅ Ali Shirali ⋅ Fateme Haredasht ⋅ Sanmi Koyejo ⋅ Phil Torr ⋅ Yong Suk Lee ⋅ Fazl Barez ⋅ Joel Lehman ⋅ Peter Norvig ⋅ Arvind Narayanan
[ OpenReview
Oral
Thu Jul 09 10:15 AM -- 10:30 AM (KST) None
CausalGame: Benchmarking Causal Thinking of LLM Agents in Games
Zhenhao Chen ⋅ Yongqiang Chen ⋅ Chenxi Liu ⋅ Junchi Yu ⋅ Xiangchen Song ⋅ Zijian Li ⋅ Jialin Li ⋅ Phil Torr ⋅ Bo Han ⋅ Kun Zhang
[ OpenReview
Oral
Thu Jul 09 10:30 AM -- 10:45 AM (KST) None
Characterizing, Evaluating, and Optimizing Complex Reasoning
Haoran Zhang ⋅ Yafu Li ⋅ Zhi Wang ⋅ Zhilin Wang ⋅ Shunkai Zhang ⋅ Xiaoye Qu ⋅ Yu Cheng
[ OpenReview
Oral
Thu Jul 09 10:45 AM -- 11:00 AM (KST) None
Rare Event Analysis of Large Language Models
Jake McAllister Dorman ⋅ Edward Gillman ⋅ Dominic C Rose ⋅ Jamie Mair ⋅ Juan Garrahan
[ OpenReview