Normalized L1 Distance: Scale-Invariant Metric

Updated 18 November 2025

Normalized L1 distance is a scale-invariant measure defined as the expected absolute difference normalized by the sum of absolute first moments, ensuring values between 0 and 1.
It provides closed-form expressions for standard distributions and connects to established indices, such as the Gini index and 1-Wasserstein metric.
Under specific independence and nonnegativity conditions, it satisfies key metric properties, thereby robustly quantifying statistical discrepancies in diverse applications.

The normalized $L_1$ -distance, denoted $D_{\rm norm}(X, Y)$ , is a probabilistic metric between real-valued integrable random variables $X$ and $Y$ , widely studied for its applications in theoretical and applied fields, such as economics and physics. This distance is defined as the expected absolute difference between $X$ and $Y$ , normalized by the sum of their absolute first moments. Structured to always lie between 0 and 1, it refines the traditional $L_1$ -distance by providing a scale-invariant measure, particularly significant when comparing distributions of differing magnitudes. The normalized $L_1$ -distance encapsulates and unifies several well-established concepts, including the Gini index, the Lukaszyk–Karmovsky metric, and emerges as a special instance within the framework of 1-Wasserstein optimal transport (Rolle, 2021).

1. Formal Definition and Properties

Let $(\Omega, \mathcal A, P)$ be a probability space with $X, Y \in \mathcal L_1(\Omega)$ , i.e., both are integrable real-valued random variables. The (compound) $D_{\rm norm}(X, Y)$ 0-distance is

$D_{\rm norm}(X, Y)$ 1

The normalized $D_{\rm norm}(X, Y)$ 2-distance, defined when $D_{\rm norm}(X, Y)$ 3, is

$D_{\rm norm}(X, Y)$ 4

and $D_{\rm norm}(X, Y)$ 5 when both expectations vanish. This yields $D_{\rm norm}(X, Y)$ 6 for all such $D_{\rm norm}(X, Y)$ 7.

Analyzing $D_{\rm norm}(X, Y)$ 8 through the axioms of metric spaces:

Non-negativity: $D_{\rm norm}(X, Y)$ 9.
Symmetry: $X$ 0.
Reflexivity: $X$ 1.
Identity of indiscernibles: $X$ 2 if and only if $X$ 3 almost surely, considering the standard identification of random variables up to almost sure equality.

In general, $X$ 4 does not always satisfy the triangle inequality. However, under the condition that $X$ 5, $X$ 6, $X$ 7 are mutually independent, integrable, and nonnegative (with at most one of them concentrated at zero), Rolle proves that $X$ 8 satisfies the triangle inequality: $X$ 9 This is achieved via a specific algebraic inequality involving the individual $Y$ 0-distances and first moments, leveraging what is termed a "Canberra-inequality" for all real $Y$ 1 (Rolle, 2021).

2. Closed-form Expressions for Standard Distributions

Explicit evaluation of $Y$ 2 is important in statistics and applied modeling. In the case of two independent Gaussians

$Y$ 3

the expected absolute difference reads

$Y$ 4

where $Y$ 5 and $Y$ 6 denote the cdf and pdf of the standard normal, respectively. The one-marginal expectation is

$Y$ 7

$Y$ 8 is then computed by substituting these closed forms.

For independent uniform variables $Y$ 9, $X$ 0, the mean absolute difference is determined through an explicit double integration: $X$ 1 with polynomials in endpoints providing concrete values in the cases of interval separation, inclusion, or general overlap. For pure separation ( $X$ 2), $X$ 3 where $X$ 4 and $X$ 5 are midpoints of the respective intervals. Table summaries of case enumeration and formulas are presented in (Rolle, 2021).

3. Domains of Application and Illustrative Behavior

Normalized $X$ 6-distance is prevalent in fields where scale invariance and robust discrepancy measures are essential. In economics, it appears as the Gini index (see §5). In physics, especially error analysis, $X$ 7 is known as the Lukaszyk–Karmovsky metric.

Figures in (Rolle, 2021) exemplify behavior in the bivariate normal setup: as the correlation $X$ 8 approaches 1, joint distributions concentrate on the diagonal, and $X$ 9 (total dependence implies null normalized distance). In uniform distributions, $Y$ 0 interpolates from 0 (total overlap) to 1 (one variable identically zero and the other nondegenerate), with critical dependence on support overlap.

4. Connections to Classical Indices and Distances

The normalized $Y$ 1-distance not only unifies disparate applications but also recovers several established quantities:

Gini index: For a distribution $Y$ 2, the Gini mean difference is $Y$ 3. The Gini index is its normalized analogue:

$Y$ 4

Thus, $Y$ 5 is the Gini index viewed as the “autodistance” of a distribution.

Lukaszyk–Karmovsky metric: $Y$ 6, introduced in physics for uncertainty quantification, possesses reflexivity contrary to early misconceptions.
Optimal transport (1-Wasserstein): If $Y$ 7 are probability laws, the Monge–Kantorovich problem with $Y$ 8 cost leads to the 1-Wasserstein distance

$Y$ 9

where $L_1$ 0 are the cdfs of $L_1$ 1. For independent $L_1$ 2, $L_1$ 3 is the cost under the trivial product coupling.

5. Mathematical and Probabilistic Structure

The normalized $L_1$ 4-distance defines a semimetric on the space of integrable random variables, becoming a full metric when restricted to independent variables, as established through the generalized triangle inequality. The proof involves verifying a nontrivial algebraic condition, ultimately relying on the positivity of the "Canberra-inequality" for all real $L_1$ 5: $L_1$ 6 This semimetric structure allows for flexible deployment across disparate random variable pairs and distributions, provided integrability conditions are met.

6. Illustrative Regimes and Range

$L_1$ 7 assumes values in $L_1$ 8, with limiting cases as follows:

$L_1$ 9: holds if $L_1$ 0 almost surely or, for instance, in the degenerate case where both random variables vanish.
$L_1$ 1: as joint law of $L_1$ 2 is concentrated on the diagonal (e.g., perfect dependence, high correlation).
$L_1$ 3: occurs when one variable is almost surely zero while the other is integrable and nondegenerate (Rolle, 2021).

This range captures scenarios of perfect equality, maximal disparity, and interpolation governed by the probabilistic and algebraic relations between the random variables’ distributions.

Normalized $L_1$ 4-distance thus provides a robust, interpretable, and mathematically grounded similarity measure unifying concepts from diverse fields, with rigorous theoretical guarantees and tractable formulae in common applied cases (Rolle, 2021).

Markdown Report Issue Upgrade to Chat

References (1)

Various issues around the L1-norm distance (2021)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Normalized L1 Distance.