DeepRepViz: Identifying Confounders in Deep Learning Model Predictions
Abstract: Deep Learning (DL) models have gained popularity in neuroimaging studies for predicting psychological behaviors, cognitive traits, and brain pathologies. However, these models can be biased by confounders such as age, sex, or imaging artifacts from the acquisition process. To address this, we introduce 'DeepRepViz', a two-part framework designed to identify confounders in DL model predictions. The first component is a visualization tool that can be used to qualitatively examine the final latent representation of the DL model. The second component is a metric called 'Con-score' that quantifies the confounder risk associated with a variable, using the final latent representation of the DL model. We demonstrate the effectiveness of the Con-score using a simple simulated setup by iteratively altering the strength of a simulated confounder and observing the corresponding change in the Con-score. Next, we validate the DeepRepViz framework on a large-scale neuroimaging dataset (n=12000) by performing three MRI-phenotype prediction tasks that include (a) predicting chronic alcohol users, (b) classifying participant sex, and (c) predicting performance speed on a cognitive task called 'trail making'. DeepRepViz identifies sex as a significant confounder in the DL model predicting chronic alcohol users (Con-score=0.35) and age as a confounder in the model predicting cognitive task performance (Con-score=0.3). In conclusion, the DeepRepViz framework provides a systematic approach to test for potential confounders such as age, sex, and imaging artifacts and improves the transparency of DL models for neuroimaging studies.
- “Risk of training diagnostic algorithms on data with demographic bias” In Interpretable and Annotation-Efficient Learning for Medical Image Computing Springer, 2020, pp. 183–192
- Yoshua Bengio, Aaron Courville and Pascal Vincent “Representation learning: A review and new perspectives” In IEEE transactions on pattern analysis and machine intelligence 35.8 IEEE, 2013, pp. 1798–1828
- “Algorithmic fairness in artificial intelligence for medicine and healthcare” In Nature Biomedical Engineering 7.6 Nature Publishing Group UK London, 2023, pp. 719–742
- “Promises and pitfalls of deep neural networks in neuroimaging-based psychiatric research” In Experimental Neurology 339 Elsevier, 2021, pp. 113608
- “Algorithmic encoding of protected characteristics in image-based models for disease detection” In arXiv preprint arXiv:2110.14755, 2021
- Ian Goodfellow, Yoshua Bengio and Aaron Courville “Deep Learning” http://www.deeplearningbook.org MIT Press, 2016
- “The same analysis approach: Practical protection against the pitfalls of novel neuroimaging analysis methods” In Neuroimage 180 Elsevier, 2018, pp. 19–30
- “Sex and age differences in atrophic rates: an ADNI study with n= 1368 MRI scans” In Neurobiology of aging 31.8 Elsevier, 2010, pp. 1463–1480
- “The quandary of covarying: A brief review and empirical examination of covariate use in structural neuroimaging studies on psychological variables” In NeuroImage 205, 2020, pp. 116225 DOI: https://doi.org/10.1016/j.neuroimage.2019.116225
- “Deep learning for Alzheimer’s disease diagnosis: A survey” In Artificial Intelligence in Medicine 130 Elsevier, 2022, pp. 102332
- “Interpretability beyond feature attribution: Quantitative testing with concept activation vectors (tcav)” In International conference on machine learning, 2018, pp. 2668–2677 PMLR
- “The Confound Continuum: A 2D confounder assessment for AI in precision medicine”, 2023
- Richard D McKelvey and William Zavoina “A statistical model for the analysis of ordinal level dependent variables” In Journal of mathematical sociology 4.1 Taylor & Francis, 1975, pp. 103–120
- Nico JD Nagelkerke “A note on a general definition of the coefficient of determination” In Biometrika 78.3 Citeseer, 1991, pp. 691–692
- Tomáš Paus “Population neuroscience: why and how” In Human brain mapping 31.6 Wiley Online Library, 2010, pp. 891–903
- “Identification of causal effects of neuroanatomy on cognitive decline requires modeling unobserved confounders” In Alzheimer’s & Dementia 19.5 Wiley Online Library, 2023, pp. 1994–2005
- Roshan Prakash Rane, Andreas Heinz and Kerstin Ritter “AIM in Alcohol and Drug Dependence” In Artificial Intelligence in Medicine Springer, 2022, pp. 1619–1628
- “Eating-related variables partially explain the prospective prediction of binge drinking from structural brain features” PsyArXiv, 2023
- “Structural differences in adolescent brains can predict alcohol misuse” In Elife 11 eLife Sciences Publications Limited, 2022, pp. e77545
- “Predictive modelling using neuroimaging data in the presence of confounds” In Neuroimage 150 Elsevier, 2017, pp. 23–49
- Ralph M Reitan “Validity of the Trail Making Test as an indicator of organic brain damage” In Perceptual and motor skills 8.3 SAGE Publications Sage CA: Los Angeles, CA, 1958, pp. 271–276
- “Underdiagnosis bias of artificial intelligence algorithms applied to chest radiographs in under-served patient populations” In Nature medicine 27.12 Nature Publishing Group, 2021, pp. 2176–2182
- Lukas Snoek, Steven Miletić and H Steven Scholte “How to control for confounds in decoding analyses of neuroimaging data” In NeuroImage 184 Elsevier, 2019, pp. 741–760
- Tamas Spisak “Statistical quantification of confounding bias in machine learning models” In GigaScience 11 Oxford Academic, 2022
- “UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age” In PLoS medicine 12.3 Public Library of Science, 2015, pp. e1001779
- “MRI field strength predicts Alzheimer’s disease: a case example of bias in the ADNI data set” In 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI), 2022, pp. 1–4 IEEE
- Sandra Vieira, Walter HL Pinaya and Andrea Mechelli “Using deep learning to investigate the neuroimaging correlates of psychiatric and neurological disorders: Methods and applications” In Neuroscience & Biobehavioral Reviews 74 Elsevier, 2017, pp. 58–75
- “Predicting sex, age, general cognition and mental health with machine learning on brain structural connectomes” In Human Brain Mapping 44.5 Wiley Online Library, 2023, pp. 1913–1933
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.