Powerful Large-scale Inference in High Dimensional Mediation Analysis
Abstract: In genome-wide epigenetic studies, exposures (e.g., Single Nucleotide Polymorphisms) affect outcomes (e.g., gene expression) through intermediate variables such as DNA methylation. Mediation analysis offers a way to study these intermediate variables and identify the presence or absence of causal mediation effects. Testing for mediation effects lead to a composite null hypothesis. Existing methods like the Sobel's test or the Max-P test are often underpowered because 1) statistical inference is often conducted based on distributions determined under a subset of the null and 2) they are not designed to shoulder the multiple testing burden. To tackle these issues, we introduce a technique called MLFDR (Mediation Analysis using Local False Discovery Rates) for high dimensional mediation analysis, which uses the local False Discovery Rates based on the coefficients of the structural equation model specifying the mediation relationship to construct a rejection region. We have shown theoretically as well as through simulation studies that in the high-dimensional setting, the new method of identifying the mediating variables controls the FDR asymptotically and performs better with respect to power than several existing methods such as DACT (Liu et al.)and JS-mixture (Dai et al).
- The moderator–mediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations. Journal of personality and social psychology 51, 1173.
- Optimal false discovery rate control for large scale multiple testing with auxiliary information.
- A multiple-testing procedure for high-dimensional mediation hypotheses. Journal of the American Statistical Association 117, 198–213.
- Two-component mixture model in the presence of covariates. Journal of the American Statistical Association 117, 1820–1834.
- Dna methylation and regulation of gene expression: Guardian of our health. The Nucleus 64, 259–270.
- Age-related gene expression changes, and transcriptome wide association study of physical and cognitive aging traits, in the lothian birth cohort 1936. Aging (Albany NY) 9, 2489.
- On strong identifiability and convergence rates of parameter estimation in finite mixtures.
- Dna methylation and healthy human aging. Aging cell 14, 924–932.
- Direct and indirect effects in a survival context. Epidemiology pages 575–581.
- Large-scale hypothesis testing for causal mediation effects with applications in genome-wide epigenetic studies. Journal of the American Statistical Association 117, 67–81.
- Smoking-related changes in dna methylation and gene expression are associated with cardio-metabolic traits. Clinical epigenetics 12, 1–16.
- A comparison of methods to test mediation and other intervening variable effects. Psychological methods 7, 83.
- Dna methylation and its basic function. Neuropsychopharmacology 38, 23–38.
- Naaman, M. (2021). On the tight constant in the multivariate dvoretzky–kiefer–wolfowitz inequality. Statistics & Probability Letters 173, 109088.
- Nguyen, X. (2013). Convergence of latent mixing measures in finite and infinite mixture models.
- Pearl, J. (2012). The causal mediation formula—a guide to the assessment of pathways and mechanisms. Prevention science 13, 426–436.
- Richardson, B. (2003). Impact of aging on dna methylation. Ageing research reviews 2, 245–261.
- Identifiability and exchangeability for direct and indirect effects. Epidemiology 3, 143–155.
- Sobel, M. E. (1982). Asymptotic confidence intervals for indirect effects in structural equation models. Sociological methodology 13, 290–312.
- Gene expression becomes heterogeneous with age. Current Biology 16, R359–R360.
- Storey, J. D. (2002). A direct approach to false discovery rates. Journal of the Royal Statistical Society Series B: Statistical Methodology 64, 479–498.
- Oracle and adaptive compound decision rules for false discovery rate control. Journal of the American Statistical Association 102, 901–912.
- Tchetgen Tchetgen, E. J. (2011). On causal mediation analysis with a survival outcome. The international journal of biostatistics 7, 0000102202155746791351.
- Mediation analysis allowing for exposure–mediator interactions and causal interpretation: theoretical assumptions and implementation with sas and spss macros. Psychological methods 18, 137.
- VanderWeele, T. J. (2016). Mediation analysis: a practitioner’s guide. Annual review of public health 37, 17–32.
- Odds ratios for mediation analysis for a dichotomous outcome. American journal of epidemiology 172, 1339–1348.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.