Poster Wed, Jul 8, 2026 • 10:30 AM – 12:15 PM KST Coex: HALL A

Mind the Gap: Catching Hallucinations via Evidence Drop on the Reasoning Manifold

QunJie Chen ⋅ Yufei Chen ⋅ Xiaodong Yue ⋅ Linye Li

Abstract

Large Language Models (LLMs) show strong reasoning abilities, yet their reliability is hindered by hallucinations, where fluent reasoning becomes factually or logically incorrect. Most existing uncertainty-based detectors rely on sequence-level averaging, which ignores the step-wise dynamics of reasoning and often misclassifies hard-but-correct or easy-but-wrong samples. We propose a dynamic perspective that models reasoning as a trajectory on a latent \emph{Evidence Manifold}, where each step is supported by local evidence. Hallucinations are characterized as \emph{Evidence Drops}, i.e., sudden declines in local evidence support that indicate topological deviations from this manifold. Based on this insight, we design a training-free and model-agnostic detector that identifies hallucinations via the worst-case Evidence Drop and enables step-level error localization. Experiments on GSM8K, MATH, and ProcessBench show consistent improvements over sequence-level uncertainty baselines in selective accuracy and risk–coverage trade-offs.