Sub-Weibull Distributions

Updated 22 December 2025

Sub-Weibull distributions are a family of probability models defined by stretched-exponential tail decay, generalizing both sub-Gaussian and sub-exponential types.
They enable sharp non-asymptotic moment inequalities and concentration bounds essential for analyzing high-dimensional, heavy-tailed data.
They are applied in robust covariance estimation, regression, and graphical models, with techniques for empirical tail index estimation playing a key role.

Sub-Weibull distributions constitute a parametrized family of probability distributions characterized by tails at least as light as, and potentially heavier than, the exponential decay of Weibull-type laws. They generalize both the sub-Gaussian and sub-exponential classes, interpolating between light-tailed and heavier, but still stretched-exponential, regimes. The framework of sub-Weibull random variables and vectors enables the development of non-asymptotic moment inequalities, sharp concentration results, and robust statistical tools for high-dimensional scenarios with heavy-tailed or non-sub-Gaussian data.

1. Formal Definitions and Characterizations

Sub-Weibull distributions can be defined and characterized via several equivalent formulations encompassing tail behavior, moment growth, Orlicz norms, and moment generating functions. For a real random variable $X$ , and tail index parameter $\theta>0$ , $X$ is called sub-Weibull of order $\theta$ (notation: $X\sim\mathrm{subW}(\theta)$ ) if any of the following (equivalent up to constants) conditions hold (Vladimirova et al., 2019, He, 19 Dec 2025, Zhang et al., 2021):

Tail Bound: $\exists a,b>0$ such that

$\mathbb{P}(|X| \geq x) \leq a\exp(-b x^{1/\theta}) \quad \forall x\geq 0$

Moment Growth: $\exists K>0$ s.t.

$(\mathbb{E}|X|^k)^{1/k} \leq K k^\theta, \quad \forall k\geq 1$

Orlicz Norm (ψ $_\alpha$ ):

$\|X\|_{\psi_\alpha} := \inf \left\{ C>0: \mathbb{E} \exp \left( |X/C|^\alpha \right) \leq 2 \right\} < \infty$

with $\alpha = 1/\theta$ .

MGF-Type (Orlicz) Condition: $\exists K>0$ so that

$\mathbb{E} \exp\left( (|X|/K)^{1/\theta} \right) \leq 2$

The sub-Weibull property is often indexed as $\mathrm{subW}(\theta)$ , or, equivalently, as the class $\mathcal{L}_{\psi_\alpha}$ via the Orlicz norm.

A random vector $X\in \mathbb{R}^d$ is called sub-Weibull( $\alpha$ ) if for all $u\in S^{d-1}$ , $\langle X,u \rangle$ is sub-Weibull of order $\alpha$ .

2. Relation to Sub-Gaussian and Sub-Exponential Regimes

The sub-Weibull hierarchy recovers classical tail behaviors for special values of $\theta$ ( $\alpha$ ), establishing a continuous spectrum of tail-decay and moment growth:

$\theta$ ( $\alpha=1/\theta$ )	Recovery	Tail Behavior	Moment Growth
$\theta=1/2$ ( $\alpha=2$ )	sub-Gaussian	$\exp(-cx^2)$	$(\mathbb{E}\|X\|^k)^{1/k} \precsim \sqrt{k}$
$\theta=1$ ( $\alpha=1$ )	sub-Exponential	$\exp(-cx)$	$(\mathbb{E}\|X\|^k)^{1/k} \precsim k$
$\theta<1$ ( $\alpha>1$ )	heavier-tailed sub-Weibull (stretched exp)	$\exp(-c x^\alpha)$	$(\mathbb{E}\|X\|^k)^{1/k} \sim k^{1/\alpha}$

The sub-Weibull family interpolates: as $\theta$ increases, distributions accommodate heavier tails (but all moments remain finite and grow polynomially in $k$ ; thus, tails remain lighter than power laws) (Vladimirova et al., 2019, He, 19 Dec 2025, Zhang et al., 2021).

3. Concentration Inequalities and Tail Bounds

Non-asymptotic concentration phenomena for sums of (possibly weighted) independent sub-Weibull random variables mirror classical Bernstein or Rosenthal bounds, but must also accommodate the heaviest allowed (stretched-exponential) deviations (Bong et al., 2023, Zhang et al., 2021):

Hoeffding/Bernstein-Type Tail: If $X_1, ..., X_n$ are independent, mean-zero, sub-Weibull( $\theta$ ) with common $\theta$ , then for suitable constant $K_\theta$ ,

$\mathbb{P}\left( \left| \sum_{i=1}^n X_i \right| \geq x \right) \leq \exp \left( - \left( \frac{x}{n K_\theta} \right)^{1/\theta} \right)$

Two-Regime (GBO) Inequality: For sum $S = \sum_{i=1}^n a_i X_i$ , the probability satisfies for all $t\geq 0$

$\mathbb{P}\left(|S| \geq s \right) \leq 2 \exp\left(- \min \left\{ \frac{s^2}{C\|b\|_2^2}, \left( \frac{s}{C\|b\|_{L_n}} \right)^\theta \right\}\right)$

where the quadratic regime dominates for small $s$ and the stretched exponential for large $s$ (Bong et al., 2023, Zhang et al., 2021).

Moment and Tail Equivalences: The following equivalence holds, up to constants:

$\|X\|_{\psi_\alpha} < \infty \quad\Longleftrightarrow\quad \mathbb{P}(|X| > t) \leq 2 \exp(- (t/K)^\alpha), \quad (\mathbb{E}|X|^p)^{1/p} \lesssim K p^{1/\alpha}$

This allows the deployment of uniform high-probability tail bounds and error analysis for sums, projections, or quadratic forms involving heavy-tailed (but sub-Weibull) random variables.

4. Estimation of the Tail Index and Empirical Techniques

For practical data analysis, estimation of the tail parameter $\theta$ (or $\alpha$ ) is essential. For a random variable with a Weibull-type tail, the $t$ -quantile takes the form $q(t) = \lambda [ -\log(1-t) ]^\theta$ , yielding a log-quantile plot linear in $\log(-\log(1-t))$ with slope $\theta$ (Vladimirova et al., 2019):

Linear Regression Estimator: Order $n$ data points as $Y_{n,n}\ge Y_{n-1,n}\ge \ldots \ge Y_{1,n}$ , select the $k$ largest, and regress

$v_i = \log Y_{n-i+1, n}\qquad u_i = \log \log(n/i)$

Then, estimate $\hat{\theta}$ as the slope of linear regression of $v_i$ on $u_i$ .

Moment Estimators: Compute empirical norms such as

$\sup_{1\leq k \leq k_{max}} \left( \frac{1}{n} \sum_{i=1}^n |X_i|^{\theta k} / k! \right)^{1/(\theta k)}$

as proxies for Orlicz- or moment-based sub-Weibull norms (Zhang et al., 2021).

Cross-validation for $\theta$ : If $\theta$ is unknown, tuning via cross-validation enables empirical model selection in successive applications.

5. Closure Properties and Algebraic Operations

The sub-Weibull classes enjoy several algebraic closure and order properties (Vladimirova et al., 2019, Zhang et al., 2021):

Inclusion: For $0<\theta_1\leq \theta_2$ ,

$\mathrm{subW}(\theta_1) \subseteq \mathrm{subW}(\theta_2)$

Heavier-tailed classes properly contain the lighter-tailed ones.

Sum/Addition: If $X\in \mathrm{subW}(\theta_1)$ and $Y\in \mathrm{subW}(\theta_2)$ ,

$X+Y \in \mathrm{subW}(\max\{\theta_1, \theta_2\})$

Product: For $X\in \mathrm{subW}(\theta_1)$ , $Y\in \mathrm{subW}(\theta_2)$ ,

$XY \in \mathrm{subW}(\theta_1 + \theta_2)$

Powers: If $X\sim\mathrm{subW}(\theta)$ , then $|X|^r\sim \mathrm{subW}(\theta/r)$ , and $\| |X|^r\|_{\psi_{\theta/r}} = \|X\|_{\psi_\theta}^r$ (Zhang et al., 2021).
Optimal Tail Index: If $(\mathbb{E}|X|^k)^{1/k} \asymp k^\theta$ as $k\to\infty$ , then $\theta$ is minimal for which $X\in\mathrm{subW}(\theta)$ .

6. Statistical Applications and Examples

The sub-Weibull formalism underpins robust statistical inference for high-dimensional and heavy-tailed data (Zhang et al., 2021, Bong et al., 2023, He, 19 Dec 2025):

Covariance Estimation: For i.i.d. $X_i$ with sub-Weibull( $\alpha$ ) tails, norm-/spectrally-truncated estimators achieve empirical error $\|\hat\Sigma-\Sigma\|_{\mathrm{op}} \lesssim \|\Sigma\|_{\mathrm{op}}\sqrt{r(\Sigma)\log N / N}$ , preserving the sub-Gaussian rate even under significantly heavier marginal distributions (He, 19 Dec 2025).
Negative Binomial Regression: For covariate vectors $X_i$ with $\theta<1$ , the $\ell_2$ -error of maximum-likelihood or Z-estimator coefficients admits sharp non-asymptotic risk bounds, with

$\|\hat\beta_n-\beta^*\|_2 \lesssim \text{poly}(\frac{p,\log(p/\delta)}{n})\cdot \log^{1/\theta}(np/\delta)$

where $p$ is the dimension and $n$ the sample size (Zhang et al., 2021).

Random Matrix Theory: For isotropic matrices with sub-Weibull( $\theta$ ) rows, Bai–Yin-type spectral norm bounds and eigenvalue location properties extend, with deviations controlled by two-regime functions $c[\sqrt{t/n} + (t/n)^{1/\theta}]$ (Zhang et al., 2021).
Graphical Models: Estimation of multiple precision/covariance matrices from high-dimensional data with sub-Weibull marginals attains sample complexity improvements, as the required $n$ scales nearly linearly in the number of models and dimension in contrast to quadratic dependence for truly heavy-tailed errors (Bong et al., 2023).
Bayesian Neural Networks: Deep units composed of Gaussian-weighted layers induce sub-Weibull marginals with tail index directly determined by network depth, with empirical confirmation by slope-of-log-quantile regression (Vladimirova et al., 2019).

7. Extensions, Limitations, and Ongoing Research

Research continues on optimal constants in tail and moment inequalities and on concentration for dependent sub-Weibull processes (martingales, mixing arrays). Data-driven selection of $\theta$ and its impact on robustness of estimators remain partly open. Extensions to handle non-i.i.d., non-isotropic, or composite tails (e.g., COM-negative binomial) are under active investigation (Zhang et al., 2021).

The sub-Weibull class forms a natural one-parameter generalization for expressing and controlling stretched-exponential tail behavior in empirical processes, reinforcing its importance in the theoretical and applied statistics literature (Vladimirova et al., 2019, Bong et al., 2023, He, 19 Dec 2025, Zhang et al., 2021).

Markdown Report Issue Upgrade to Chat

References (4)

Sub-Weibull distributions: generalizing sub-Gaussian and sub-Exponential properties to heavier-tailed distributions (2019)

Fast and Robust: Computationally Efficient Covariance Estimation for Sub-Weibull Vectors (2025)

Sharper Sub-Weibull Concentrations (2021)

Tight Concentration Inequality for Sub-Weibull Random Variables with Generalized Bernstien Orlicz norm (2023)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Sub-Weibull Distributions.

Sub-Weibull Distributions

1. Formal Definitions and Characterizations

2. Relation to Sub-Gaussian and Sub-Exponential Regimes

3. Concentration Inequalities and Tail Bounds

4. Estimation of the Tail Index and Empirical Techniques

5. Closure Properties and Algebraic Operations

6. Statistical Applications and Examples

7. Extensions, Limitations, and Ongoing Research

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

Sub-Weibull Distributions

1. Formal Definitions and Characterizations

2. Relation to Sub-Gaussian and Sub-Exponential Regimes

3. Concentration Inequalities and Tail Bounds

4. Estimation of the Tail Index and Empirical Techniques

5. Closure Properties and Algebraic Operations

6. Statistical Applications and Examples

7. Extensions, Limitations, and Ongoing Research

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research