Why DDIM Hallucinates More than DDPM: A Theoretical Analysis of Reverse Dynamics
Muhammad Ashiq ⋅ Samanyu Arora ⋅ Abhinav Narayan Harish ⋅ Ishaan Kharbanda ⋅ Hung Yun Tseng ⋅ Grigorios Chrysos
Abstract
We theoretically study the hallucination phenomena in two canonical diffusion samplers: the stochastic Denoising Diffusion Probabilistic Model (DDPM) and the deterministic Denoising Diffusion Implicit Model (DDIM). We analyze the reverse ODE (DDIM) and SDE (DDPM) for a Gaussian mixture target, proving that after a critical time $\tau$, (a) DDIM can become stuck on the segment connecting the two nearest modes and (b) the *stochasticity of DDPM* helps DDPM become unstuck from this region, thus avoiding hallucination. Our empirical validation verifies that DDPM has a significantly lower hallucination rate than DDIM when this region is entered. Building on our observations, we exhibit how using additional stochastic steps can help DDIM avoid hallucinations and offer new insights on how to design improved samplers.
Successful Page Load