Papers
Topics
Authors
Recent
Search
2000 character limit reached

Sub-Weibull Distributions

Updated 22 December 2025
  • Sub-Weibull distributions are a family of probability models defined by stretched-exponential tail decay, generalizing both sub-Gaussian and sub-exponential types.
  • They enable sharp non-asymptotic moment inequalities and concentration bounds essential for analyzing high-dimensional, heavy-tailed data.
  • They are applied in robust covariance estimation, regression, and graphical models, with techniques for empirical tail index estimation playing a key role.

Sub-Weibull distributions constitute a parametrized family of probability distributions characterized by tails at least as light as, and potentially heavier than, the exponential decay of Weibull-type laws. They generalize both the sub-Gaussian and sub-exponential classes, interpolating between light-tailed and heavier, but still stretched-exponential, regimes. The framework of sub-Weibull random variables and vectors enables the development of non-asymptotic moment inequalities, sharp concentration results, and robust statistical tools for high-dimensional scenarios with heavy-tailed or non-sub-Gaussian data.

1. Formal Definitions and Characterizations

Sub-Weibull distributions can be defined and characterized via several equivalent formulations encompassing tail behavior, moment growth, Orlicz norms, and moment generating functions. For a real random variable XX, and tail index parameter θ>0\theta>0, XX is called sub-Weibull of order θ\theta (notation: XsubW(θ)X\sim\mathrm{subW}(\theta)) if any of the following (equivalent up to constants) conditions hold (Vladimirova et al., 2019, He, 19 Dec 2025, Zhang et al., 2021):

  • Tail Bound: a,b>0\exists a,b>0 such that

P(Xx)aexp(bx1/θ)x0\mathbb{P}(|X| \geq x) \leq a\exp(-b x^{1/\theta}) \quad \forall x\geq 0

  • Moment Growth: K>0\exists K>0 s.t.

(EXk)1/kKkθ,k1(\mathbb{E}|X|^k)^{1/k} \leq K k^\theta, \quad \forall k\geq 1

  • Orlicz Norm (ψα_\alpha):

Xψα:=inf{C>0:Eexp(X/Cα)2}<\|X\|_{\psi_\alpha} := \inf \left\{ C>0: \mathbb{E} \exp \left( |X/C|^\alpha \right) \leq 2 \right\} < \infty

with α=1/θ\alpha = 1/\theta.

  • MGF-Type (Orlicz) Condition: K>0\exists K>0 so that

Eexp((X/K)1/θ)2\mathbb{E} \exp\left( (|X|/K)^{1/\theta} \right) \leq 2

The sub-Weibull property is often indexed as subW(θ)\mathrm{subW}(\theta), or, equivalently, as the class Lψα\mathcal{L}_{\psi_\alpha} via the Orlicz norm.

A random vector XRdX\in \mathbb{R}^d is called sub-Weibull(α\alpha) if for all uSd1u\in S^{d-1}, X,u\langle X,u \rangle is sub-Weibull of order α\alpha.

2. Relation to Sub-Gaussian and Sub-Exponential Regimes

The sub-Weibull hierarchy recovers classical tail behaviors for special values of θ\theta (α\alpha), establishing a continuous spectrum of tail-decay and moment growth:

θ\theta (α=1/θ\alpha=1/\theta) Recovery Tail Behavior Moment Growth
θ=1/2\theta=1/2 (α=2\alpha=2) sub-Gaussian exp(cx2)\exp(-cx^2) (EXk)1/kk(\mathbb{E}|X|^k)^{1/k} \precsim \sqrt{k}
θ=1\theta=1 (α=1\alpha=1) sub-Exponential exp(cx)\exp(-cx) (EXk)1/kk(\mathbb{E}|X|^k)^{1/k} \precsim k
θ<1\theta<1 (α>1\alpha>1) heavier-tailed sub-Weibull (stretched exp) exp(cxα)\exp(-c x^\alpha) (EXk)1/kk1/α(\mathbb{E}|X|^k)^{1/k} \sim k^{1/\alpha}

The sub-Weibull family interpolates: as θ\theta increases, distributions accommodate heavier tails (but all moments remain finite and grow polynomially in kk; thus, tails remain lighter than power laws) (Vladimirova et al., 2019, He, 19 Dec 2025, Zhang et al., 2021).

3. Concentration Inequalities and Tail Bounds

Non-asymptotic concentration phenomena for sums of (possibly weighted) independent sub-Weibull random variables mirror classical Bernstein or Rosenthal bounds, but must also accommodate the heaviest allowed (stretched-exponential) deviations (Bong et al., 2023, Zhang et al., 2021):

  • Hoeffding/Bernstein-Type Tail: If X1,...,XnX_1, ..., X_n are independent, mean-zero, sub-Weibull(θ\theta) with common θ\theta, then for suitable constant KθK_\theta,

P(i=1nXix)exp((xnKθ)1/θ)\mathbb{P}\left( \left| \sum_{i=1}^n X_i \right| \geq x \right) \leq \exp \left( - \left( \frac{x}{n K_\theta} \right)^{1/\theta} \right)

  • Two-Regime (GBO) Inequality: For sum S=i=1naiXiS = \sum_{i=1}^n a_i X_i, the probability satisfies for all t0t\geq 0

P(Ss)2exp(min{s2Cb22,(sCbLn)θ})\mathbb{P}\left(|S| \geq s \right) \leq 2 \exp\left(- \min \left\{ \frac{s^2}{C\|b\|_2^2}, \left( \frac{s}{C\|b\|_{L_n}} \right)^\theta \right\}\right)

where the quadratic regime dominates for small ss and the stretched exponential for large ss (Bong et al., 2023, Zhang et al., 2021).

  • Moment and Tail Equivalences: The following equivalence holds, up to constants:

Xψα<P(X>t)2exp((t/K)α),(EXp)1/pKp1/α\|X\|_{\psi_\alpha} < \infty \quad\Longleftrightarrow\quad \mathbb{P}(|X| > t) \leq 2 \exp(- (t/K)^\alpha), \quad (\mathbb{E}|X|^p)^{1/p} \lesssim K p^{1/\alpha}

This allows the deployment of uniform high-probability tail bounds and error analysis for sums, projections, or quadratic forms involving heavy-tailed (but sub-Weibull) random variables.

4. Estimation of the Tail Index and Empirical Techniques

For practical data analysis, estimation of the tail parameter θ\theta (or α\alpha) is essential. For a random variable with a Weibull-type tail, the tt-quantile takes the form q(t)=λ[log(1t)]θq(t) = \lambda [ -\log(1-t) ]^\theta, yielding a log-quantile plot linear in log(log(1t))\log(-\log(1-t)) with slope θ\theta (Vladimirova et al., 2019):

  • Linear Regression Estimator: Order nn data points as Yn,nYn1,nY1,nY_{n,n}\ge Y_{n-1,n}\ge \ldots \ge Y_{1,n}, select the kk largest, and regress

vi=logYni+1,nui=loglog(n/i)v_i = \log Y_{n-i+1, n}\qquad u_i = \log \log(n/i)

Then, estimate θ^\hat{\theta} as the slope of linear regression of viv_i on uiu_i.

  • Moment Estimators: Compute empirical norms such as

sup1kkmax(1ni=1nXiθk/k!)1/(θk)\sup_{1\leq k \leq k_{max}} \left( \frac{1}{n} \sum_{i=1}^n |X_i|^{\theta k} / k! \right)^{1/(\theta k)}

as proxies for Orlicz- or moment-based sub-Weibull norms (Zhang et al., 2021).

  • Cross-validation for θ\theta: If θ\theta is unknown, tuning via cross-validation enables empirical model selection in successive applications.

5. Closure Properties and Algebraic Operations

The sub-Weibull classes enjoy several algebraic closure and order properties (Vladimirova et al., 2019, Zhang et al., 2021):

  • Inclusion: For 0<θ1θ20<\theta_1\leq \theta_2,

subW(θ1)subW(θ2)\mathrm{subW}(\theta_1) \subseteq \mathrm{subW}(\theta_2)

Heavier-tailed classes properly contain the lighter-tailed ones.

  • Sum/Addition: If XsubW(θ1)X\in \mathrm{subW}(\theta_1) and YsubW(θ2)Y\in \mathrm{subW}(\theta_2),

X+YsubW(max{θ1,θ2})X+Y \in \mathrm{subW}(\max\{\theta_1, \theta_2\})

  • Product: For XsubW(θ1)X\in \mathrm{subW}(\theta_1), YsubW(θ2)Y\in \mathrm{subW}(\theta_2),

XYsubW(θ1+θ2)XY \in \mathrm{subW}(\theta_1 + \theta_2)

  • Powers: If XsubW(θ)X\sim\mathrm{subW}(\theta), then XrsubW(θ/r)|X|^r\sim \mathrm{subW}(\theta/r), and Xrψθ/r=Xψθr\| |X|^r\|_{\psi_{\theta/r}} = \|X\|_{\psi_\theta}^r (Zhang et al., 2021).
  • Optimal Tail Index: If (EXk)1/kkθ(\mathbb{E}|X|^k)^{1/k} \asymp k^\theta as kk\to\infty, then θ\theta is minimal for which XsubW(θ)X\in\mathrm{subW}(\theta).

6. Statistical Applications and Examples

The sub-Weibull formalism underpins robust statistical inference for high-dimensional and heavy-tailed data (Zhang et al., 2021, Bong et al., 2023, He, 19 Dec 2025):

  • Covariance Estimation: For i.i.d. XiX_i with sub-Weibull(α\alpha) tails, norm-/spectrally-truncated estimators achieve empirical error Σ^ΣopΣopr(Σ)logN/N\|\hat\Sigma-\Sigma\|_{\mathrm{op}} \lesssim \|\Sigma\|_{\mathrm{op}}\sqrt{r(\Sigma)\log N / N}, preserving the sub-Gaussian rate even under significantly heavier marginal distributions (He, 19 Dec 2025).
  • Negative Binomial Regression: For covariate vectors XiX_i with θ<1\theta<1, the 2\ell_2-error of maximum-likelihood or Z-estimator coefficients admits sharp non-asymptotic risk bounds, with

β^nβ2poly(p,log(p/δ)n)log1/θ(np/δ)\|\hat\beta_n-\beta^*\|_2 \lesssim \text{poly}(\frac{p,\log(p/\delta)}{n})\cdot \log^{1/\theta}(np/\delta)

where pp is the dimension and nn the sample size (Zhang et al., 2021).

  • Random Matrix Theory: For isotropic matrices with sub-Weibull(θ\theta) rows, Bai–Yin-type spectral norm bounds and eigenvalue location properties extend, with deviations controlled by two-regime functions c[t/n+(t/n)1/θ]c[\sqrt{t/n} + (t/n)^{1/\theta}] (Zhang et al., 2021).
  • Graphical Models: Estimation of multiple precision/covariance matrices from high-dimensional data with sub-Weibull marginals attains sample complexity improvements, as the required nn scales nearly linearly in the number of models and dimension in contrast to quadratic dependence for truly heavy-tailed errors (Bong et al., 2023).
  • Bayesian Neural Networks: Deep units composed of Gaussian-weighted layers induce sub-Weibull marginals with tail index directly determined by network depth, with empirical confirmation by slope-of-log-quantile regression (Vladimirova et al., 2019).

7. Extensions, Limitations, and Ongoing Research

Research continues on optimal constants in tail and moment inequalities and on concentration for dependent sub-Weibull processes (martingales, mixing arrays). Data-driven selection of θ\theta and its impact on robustness of estimators remain partly open. Extensions to handle non-i.i.d., non-isotropic, or composite tails (e.g., COM-negative binomial) are under active investigation (Zhang et al., 2021).

The sub-Weibull class forms a natural one-parameter generalization for expressing and controlling stretched-exponential tail behavior in empirical processes, reinforcing its importance in the theoretical and applied statistics literature (Vladimirova et al., 2019, Bong et al., 2023, He, 19 Dec 2025, Zhang et al., 2021).

Topic to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Sub-Weibull Distributions.