Causal Discovery with Language Models as Imperfect Experts
Abstract: Understanding the causal relationships that underlie a system is a fundamental prerequisite to accurate decision-making. In this work, we explore how expert knowledge can be used to improve the data-driven identification of causal graphs, beyond Markov equivalence classes. In doing so, we consider a setting where we can query an expert about the orientation of causal relationships between variables, but where the expert may provide erroneous information. We propose strategies for amending such expert knowledge based on consistency properties, e.g., acyclicity and conditional independencies in the equivalence class. We then report a case study, on real data, where a LLM is used as an imperfect expert.
- On the completeness of causal discovery in the presence of latent confounding with tiered background knowledge. In Chiappa, S. and Calandra, R. (eds.), Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, volume 108 of Proceedings of Machine Learning Research, pp. 4002–4011. PMLR, 26–28 Aug 2020. URL https://proceedings.mlr.press/v108/andrews20a.html.
- Constitutional ai: Harmlessness from ai feedback. arXiv preprint arXiv:2212.08073, 2022.
- The alarm monitoring system: A case study with two probabilistic inference techniques for belief networks. pp. 247–256, 1989.
- Adaptive probabilistic networks with hidden variables. Machine Learning, 29(2-3):213–244, 1997.
- Differentiable causal discovery from interventional data. Advances in Neural Information Processing Systems, 33:21865–21877, 2020.
- Typing assumptions improve identification in causal discovery. In Conference on Causal Learning and Reasoning, pp. 162–177. PMLR, 2022.
- Learning bayesian networks with ancestral constraints. Advances in Neural Information Processing Systems, 29, 2016.
- Chickering, D. M. Optimal structure identification with greedy search. Journal of machine learning research, 3(Nov):507–554, 2002.
- Lmpriors: Pre-trained language models as task-specific priors. arXiv preprint arXiv: 2210.12530, 2022.
- The impact of prior knowledge on causal structure learning. Knowledge and Information Systems, pp. 1–50, 2023.
- Bayesian network learning algorithms using structural restrictions. International Journal of Approximate Reasoning, 45(2):233–254, 2007.
- On the number of experiments sufficient and in the worst case necessary to identify all causal relations among n variables. In Conference on Uncertainty in Artificial Intelligence, 2005.
- Review of causal discovery methods based on graphical models. Frontiers in Genetics, 10, 2019. ISSN 1664-8021. doi: 10.3389/fgene.2019.00524. URL https://www.frontiersin.org/articles/10.3389/fgene.2019.00524.
- Investigating causal understanding in llms. 2022.
- Language models (mostly) know what they know. arXiv preprint arXiv:2207.05221, 2022.
- Causal reasoning and large language models: Opening a new frontier for causality. arXiv preprint arXiv:2305.00050, 2023.
- Local computation with probabilities on graphical structures and their application to expert systems (with discussion). Journal of the Royal Statistical Society: Series B (Statistical Methodology), 50(2):157–224, 1988.
- Bayesian network structure learning with side constraints. In International conference on probabilistic graphical models, pp. 225–236. PMLR, 2018.
- Can large language models build causal graphs? arXiv preprint arXiv: 2303.05279, 2023.
- Estimating high-dimensional intervention effects from observational data. The Annals of Statistics, 37(6A):3133 – 3164, 2009. doi: 10.1214/09-AOS685. URL https://doi.org/10.1214/09-AOS685.
- Meek, C. Causal inference and causal explanation with background knowledge. In Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence, UAI’95, pp. 403–410, San Francisco, CA, USA, 1995. Morgan Kaufmann Publishers Inc. ISBN 1558603859.
- Distinguishing cause from effect using observational data: Methods and benchmarks. Journal of Machine Learning Research, 17(32):1–102, 2016. URL http://jmlr.org/papers/v17/14-518.html.
- Joint causal inference from multiple contexts. The Journal of Machine Learning Research, 21(1):3919–4026, 2020.
- Repair of partly misspecified causal diagrams. Epidemiology, 28, 2017.
- OpenAI. Gpt-4 technical report, 2023a.
- OpenAI, R. Gpt-4 technical report. arXiv, pp. 2303–08774, 2023b.
- Training language models to follow instructions with human feedback. Advances in Neural Information Processing Systems, 35:27730–27744, 2022.
- Elements of causal inference: foundations and learning algorithms. The MIT Press, 2017.
- Inferring causation from time series in earth system sciences. Nature communications, 10(1):2553, 2019.
- Causal protein-signaling networks derived from multiparameter single-cell data. Science, 308(5721):523–529, 2005.
- The tetrad project: Constraint based aids to causal model specification. Multivariate Behavioral Research, 33(1):65–117, 1998.
- Scutari, M. Learning bayesian networks with the bnlearn R package. Journal of Statistical Software, 35(3):1–22, 2010. doi: 10.18637/jss.v035.i03.
- Learning in probabilistic expert systems. pp. 447–466, 1992.
- Constructing bayesian network models of gene expression networks from microarray data. 2000.
- Causal inference in the presence of latent variables and selection bias. arXiv preprint arXiv:1302.4983, 2013.
- Causal-discovery performance of chatgpt in the context of neuropathic pain diagnosis. 2023.
- Can foundation models talk causality? arXiv preprint arXiv:2206.10591, 2022.
- Dags with no tears: Continuous optimization for structure learning. Advances in neural information processing systems, 31, 2018.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.