Timezone: »
We discover that neural networks exhibit approximate logical dependencies among neurons, and we introduce Neuron Dependency Graphs (NDG) that extract and present them as directed graphs. In an NDG, each node corresponds to the boolean activation value of a neuron, and each edge models an approximate logical implication from one node to another. We show that the logical dependencies extracted from the training dataset generalize well to the test set. In addition to providing symbolic explanations to the neural network's internal structure, NDGs can represent a Structural Causal Model. We empirically show that an NDG is a causal abstraction of the corresponding neural network that "unfolds" the same way under causal interventions using the theory by Geiger et al. (2021). Code is available at https://github.com/phimachine/ndg.
Author Information
Yaojie Hu (Iowa State University)
Jin Tian (Iowa State University)
Related Events (a corresponding poster, oral, or spotlight)
-
2022 Poster: Neuron Dependency Graphs: A Causal Abstraction of Neural Networks »
Tue. Jul 19th through Wed the 20th Room Hall E #905
More from the Same Authors
-
2023 Poster: Instrumental Variable Estimation of Average Partial Causal Effects »
Yuta Kawakami · manabu kuroki · Jin Tian -
2023 Poster: Estimating Joint Treatment Effects by Combining Multiple Experiments »
Yonghan Jung · Jin Tian · Elias Bareinboim -
2022 Poster: Partial Counterfactual Identification from Observational and Experimental Data »
Junzhe Zhang · Jin Tian · Elias Bareinboim -
2022 Poster: On Measuring Causal Contributions via do-interventions »
Yonghan Jung · Shiva Kasiviswanathan · Jin Tian · Dominik Janzing · Patrick Bloebaum · Elias Bareinboim -
2022 Spotlight: On Measuring Causal Contributions via do-interventions »
Yonghan Jung · Shiva Kasiviswanathan · Jin Tian · Dominik Janzing · Patrick Bloebaum · Elias Bareinboim -
2022 Spotlight: Partial Counterfactual Identification from Observational and Experimental Data »
Junzhe Zhang · Jin Tian · Elias Bareinboim -
2021 Poster: Estimating Identifiable Causal Effects on Markov Equivalence Class through Double Machine Learning »
Yonghan Jung · Jin Tian · Elias Bareinboim -
2021 Spotlight: Estimating Identifiable Causal Effects on Markov Equivalence Class through Double Machine Learning »
Yonghan Jung · Jin Tian · Elias Bareinboim -
2019 Poster: Adjustment Criteria for Generalizing Experimental Findings »
Juan Correa · Jin Tian · Elias Bareinboim -
2019 Oral: Adjustment Criteria for Generalizing Experimental Findings »
Juan Correa · Jin Tian · Elias Bareinboim