Skip to yearly menu bar Skip to main content


Chain-of-Thought Hub: A Continuous Effort to Measure Large Language Models’ Reasoning Performance

Yao Fu ⋅ Litu Ou ⋅ Yuhao Wan ⋅ Mingyu Chen ⋅ Hao Peng ⋅ Tushar Khot

Abstract

Video

Chat is not available.