Inference of Causal Networks using a Topological Threshold
Abstract: We propose a constraint-based algorithm, which automatically determines causal relevance thresholds, to infer causal networks from data. We call these topological thresholds. We present two methods for determining the threshold: the first seeks a set of edges that leaves no disconnected nodes in the network; the second seeks a causal large connected component in the data. We tested these methods both for discrete synthetic and real data, and compared the results with those obtained for the PC algorithm, which we took as the benchmark. We show that this novel algorithm is generally faster and more accurate than the PC algorithm. The algorithm for determining the thresholds requires choosing a measure of causality. We tested our methods for Fisher Correlations, commonly used in PC algorithm (for instance in \cite{kalisch2005}), and further proposed a discrete and asymmetric measure of causality, that we called Net Influence, which provided very good results when inferring causal networks from discrete data. This metric allows for inferring directionality of the edges in the process of applying the thresholds, speeding up the inference of causal DAGs.
- Estimating high-dimensional directed acyclic graphs with the pc-algorithm, 2005.
- Causal discovery in manufacturing: A structured literature review. Journal of Manufacturing and Materials Processing, 6(1):10, 2022.
- An influence-based approach for root cause alarm discovery in telecom networks. In International Conference on Service-Oriented Computing, pages 124–136. Springer, 2020.
- Modeling cumulative biological phenomena with suppes-bayes causal networks. Evolutionary Bioinformatics, 14:1176934318785167, 2018.
- Causal networks in EIA. Environmental Impact Assessment Review, 26(6):553–569, 2006.
- Inferring causation from time series in earth system sciences. Nature communications, 10(1):2553, 2019.
- Applications of bayesian networks in meteorology. In Advances in Bayesian networks, pages 309–328. Springer, 2004.
- Judea Pearl. Bayesian networks: A model of self-activated memory for evidential reasoning. In Report (University of California, Los Angeles. Computer Science Dept.), 1985.
- Judea Pearl. Causality: Models, Reasoning and Inference. Cambridge University Press, 2nd edition, 2009.
- d-separation: From theorems to algorithms, 2013.
- Directed hypergraphs: Introduction and fundamental algorithms—a survey. Theoretical Computer Science, 658:293–306, 2017. Horn formulas, directed hypergraphs, lattices and closure systems: related formalism and application.
- Equivalence and synthesis of causal models. In Proceedings of the Sixth Annual Conference on Uncertainty in Artificial Intelligence, UAI ’90, page 255–270, USA, 1990. Elsevier Science Inc.
- DAGs with NO TEARS: Continuous optimization for structure learning, 2018.
- A fast PC algorithm for high dimensional causal discovery with multi-core PCs. IEEE/ACM Trans. Comput. Biol. Bioinformatics, 16(5):1483–1495, sep 2019.
- Learning bayesian network structures using weakest mutual-information-first strategy. International Journal of Approximate Reasoning, 114:84–98, 2019.
- Causal inference via sparse additive models with application to online advertising. Proceedings of the AAAI Conference on Artificial Intelligence, 29(1), 2 2015.
- An efficient bayesian network structure learning algorithm based on structural information. Swarm and Evolutionary Computation, 76:101224, 2023.
- Empirical evaluation of scoring functions for bayesian network model selection. BMC Bioinformatics, 13(S15), sep 2012.
- A bayesian method for the induction of probabilistic networks from data - machine learning. SpringerLink, 1992.
- Causation, Prediction, and Search. MIT press, 2nd edition, 2000.
- Bayesian network induction via local neighborhoods. Advances in neural information processing systems, 12, 1999.
- Efficient markov network structure discovery using independence tests. J. Artif. Int. Res., 35(1):449–484, jul 2009.
- Learning bayesian networks from data: An information-theory based approach. Artificial Intelligence, 137(1):43–90, 2002.
- The max-min hill-climbing bayesian network structure learning algorithm. Machine learning, 65:31–78, 2006.
- Consistent learning bayesian networks with thousands of variables. In Antti Hyttinen, Joe Suzuki, and Brandon Malone, editors, Proceedings of The 3rd International Workshop on Advanced Methodologies for Bayesian Networks, volume 73 of Proceedings of Machine Learning Research, pages 57–68. PMLR, 20–22 Sep 2017.
- A bayesian approach to constraint based causal inference. arXiv preprint arXiv:1210.4866, 2012.
- The hardness of conditional independence testing and the generalised covariance measure. The Annals of Statistics, 48(3):1514–1538, 2020.
- Testing conditional independence in supervised learning algorithms. Machine Learning, 110(8):2107–2129, 2021.
- A model of inexact reasoning in medicine. Mathematical biosciences, 23(3-4):351–379, 1975.
- Knee/elbow estimation based on first derivative threshold. In 2018 IEEE Fourth International Conference on Big Data Computing Service and Applications (BigDataService), pages 237–240, 2018.
- gcastle: A python toolbox for causal discovery, 2021.
- Brian W Matthews. Comparison of the predicted and observed secondary structure of t4 phage lysozyme. Biochimica et Biophysica Acta (BBA)-Protein Structure, 405(2):442–451, 1975.
- Local computations with probabilities on graphical structures and their application to expert systems. Journal of the Royal Statistical Society. Series B (Methodological), 50(2):157–224, 1988.
- Directed acyclic graph neural networks, 2021.
- The alarm monitoring system: A case study with two probabilistic inference techniques for belief networks. In Jim Hunter, John Cookson, and Jeremy Wyatt, editors, AIME 89, pages 247–256, Berlin, Heidelberg, 1989. Springer Berlin Heidelberg.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.