Causal discovery for time series with endogenous context variables
Abstract
Many real-world systems exhibit both context- and time-dependent causal dynamics, where the dynamical system state also influences its context. For instance, soil moisture is driven by precipitation, yet also provides the context for heat-flux realization. We capture such dynamics in Structural Causal Models (SCMs) by introducing endogenous and time-dependent discrete context variables, also allowing for possibly lagged dependencies with the system variables. While context variables are discrete, they may also be proxies of continuous variables. The enabling assumptions for causal discovery of our model are either persistence of the context or sparsity of the context–system dependencies. We design two new PCMCI-based algorithms for causal discovery with endogenous context variables for time series and prove their soundness. A systematic evaluation on synthetic benchmarks and an application to a real-world land-atmosphere feedback problem demonstrate their effectiveness and applicability.