Papers
Topics
Authors
Recent
Search
2000 character limit reached

Adaptive FDP Estimators

Updated 17 December 2025
  • Adaptive FDP estimators are statistical procedures that use data-driven methods to accurately estimate the false discovery proportion while ensuring finite-sample guarantees.
  • They employ adaptive techniques, including linear step-up procedures, competition-based bounds, and high-dimensional regression adjustments, to improve power over classical methods.
  • Empirical evaluations and theoretical guarantees demonstrate that these estimators offer tight simultaneous confidence envelopes and scalability, even under differential privacy constraints.

An adaptive FDP estimator is a statistical procedure or formula that estimates or bounds the false discovery proportion (FDP), defined as the ratio V/RV/R of the number VV of false discoveries (true nulls rejected) to the total number RR of rejections, in a way that leverages data-driven or parameter-adaptive choices, often with finite-sample and/or distributional guarantees. The recent literature examines adaptive FDP estimation under classical multiple testing (via p-values), competition-based FDR control (e.g., knockoffs), high-dimensional regression, and differential privacy, with several lines of inquiry on estimator consistency, simultaneous confidence envelopes, post-hoc flexibility, and efficiency.

1. Fundamental Definitions and Context

Let H1,,HmH_1,\dots,H_m denote mm null hypotheses tested simultaneously. Given a selection rule that rejects RR hypotheses (among which VV are falsely rejected true nulls), the false discovery proportion is

FDP=VR(0/0:=0),\text{FDP} = \frac{V}{R} \quad (0/0 := 0),

and the false discovery rate is FDR=E[FDP]\text{FDR} = \mathbb{E}[\text{FDP}]. Adaptive FDP estimation is distinguished by procedures that depend on data-driven estimators of nuisance parameters, typically the proportion of true nulls, or involve post hoc bounds informed by the observed data structure (Ditzhaus et al., 2018, Ebadi et al., 2023, Hemerik et al., 2022, Jeng et al., 2018).

2. Adaptive FDP Estimators in Classical Multiple Testing

Adaptive linear step-up procedures, generalizing the Benjamini-Hochberg (BH) procedure, use an estimator m^0\widehat m_0 for the (unknown) number VV0 of true nulls. This enables more powerful testing by adapting critical values: VV1 with VV2 a tuning constant. Estimating VV3 (e.g., via convex combinations of generalized Storey-type estimators) and restricting to p-values below VV4 yields exact formulas for all moments of the FDP: VV5

VV6

and similarly for higher moments (Ditzhaus et al., 2018). Estimator stability and abundance of rejections are necessary and sufficient for consistency (VV7 in probability).

A large class of adaptive estimators involves bin-wise partitions VV8 and weights VV9,

RR0

These are shown to control FDR at finite RR1, with strictly improved asymptotic FDR and power over BH when RR2 (Ditzhaus et al., 2018).

3. Data-Adaptive FDP Bounds and Simultaneous Envelopes

Recent developments provide median-unbiased and simultaneous envelopes for the FDP, valid over entire rejection paths and allowing post hoc choice of target FDP (denoted RR3). Under symmetry and mild stochastic ordering of true null p-values, one has

RR4

yielding the adaptive bound

RR5

with RR6. To validly select RR7 post hoc, the envelope is strengthened: RR8 ensuring RR9 (Hemerik et al., 2022). These approaches yield computationally efficient algorithms (linear time after sorting) and mFDP-controlling procedures, with enhanced flexibility and interpretable adjusted p-values.

4. Competition-Based Adaptive FDP Bounds

In competition-based FDR control (e.g., knockoffs, target-decoy competition), adaptive FDP bounds are constructed via negative-binomial processes. For each hypothesis H1,,HmH_1,\dots,H_m0, one defines target and decoy wins, with the label H1,,HmH_1,\dots,H_m1 or H1,,HmH_1,\dots,H_m2. For any cutoff H1,,HmH_1,\dots,H_m3, denote H1,,HmH_1,\dots,H_m4 the decoy count and H1,,HmH_1,\dots,H_m5 the target wins, with FDP H1,,HmH_1,\dots,H_m6.

Two principal adaptive upper bands for FDP are developed:

  • Standardized band (TDC-SB):

H1,,HmH_1,\dots,H_m7

with H1,,HmH_1,\dots,H_m8 and H1,,HmH_1,\dots,H_m9 the quantile of the standardized process.

  • Uniform band (TDC-UB):

mm0

where mm1 is the largest threshold guaranteeing mm2.

Both bands are shown to adapt tightly to the data (especially for small decoy counts) and empirically outperform the Katsevich–Ramdas bound in diverse settings, maintaining finite-sample exactness and scalability (Ebadi et al., 2023).

5. High-Dimensional Regression: Adaptive FDP Estimation

In high-dimensional regression, the de-sparsified Lasso (“DLasso”) estimator provides adaptive FDP estimation for variable selection. The statistic

mm3

is used to rank predictors. The plug-in FDP estimator

mm4

approximates the expected number of false discoveries via Normal tail probabilities. The threshold mm5 yields consistent FDP control under standard design and sparsity assumptions (Jeng et al., 2018).

6. Adaptive Estimation under Differential Privacy Constraints

Federated differential privacy (FDP) introduces new challenges for adaptation. In federated density estimation, servers add carefully tuned exponential noise in a multiscale oscillation norm to wavelet coefficient estimates, yielding (ε,0)-FDP privacy: mm6 Post-processing via block thresholding produces estimators attaining sharp adaptive rates: mm7 with analogous bounds for pointwise estimation. Lower bounds demonstrate that adaptation in global risk under FDP incurs an intrinsic mm8 factor in the privacy term, and the pointwise risk incurs two such factors, reflecting the unavoidable privacy-adaptation trade-off (Cai et al., 16 Dec 2025).

7. Comparative Properties and Empirical Performance

Adaptive FDP estimators expand on classical FDR control by:

  • Providing exact finite-sample moment formulas for the FDP (Ditzhaus et al., 2018).
  • Enabling post hoc selection of target FDP levels with simultaneous envelope guarantees (Hemerik et al., 2022).
  • Offering tighter bounds, especially at low decoy counts, when benchmarking against existing simultaneous FDP-control bands (Ebadi et al., 2023).
  • Achieving consistency and minimax rates, even in challenging regimes (high-dimensional, privacy constrained), where classical approaches may suffer conservatism or inefficiency (Jeng et al., 2018, Cai et al., 16 Dec 2025).

Empirical evaluations confirm that adaptive procedures maintain nominal control and improve power or tightness against competitors. A plausible implication is that envelope-based or negative-binomial-process bounds can yield sharper guarantees and computational tractability, even in the presence of dependence or unknown null proportions.


In summary, adaptive FDP estimators encompass a broad family of data-driven procedures for simultaneous multiple testing, variable selection, competition frameworks, and privacy-preserving inference. They achieve finite-sample validity, enhanced flexibility, and improved performance relative to traditional mean-FDP control, with rigorous consistency and minimax adaptivity in diverse statistical models.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Adaptive FDP Estimator.