Quantile Isometry & Functional Slicing
- Quantile isometry and functional slicing are mathematical frameworks that embed probability measures into L2 spaces via quantile functions for efficient Wasserstein computations.
- They underpin the double-sliced Wasserstein distance, dramatically reducing computational complexity while ensuring robust metric evaluation in diverse applications.
- By extending classical sliced-Wasserstein approaches with functional projections, these methods enable stable, scalable optimal transport analysis in high-dimensional and meta-measure settings.
Quantile isometry and functional slicing constitute the core mathematical innovations underpinning scalable optimal transport (OT) for probability measures and, in particular, for meta-measures (measures over measures). These concepts connect the structure of 1D Wasserstein spaces to function spaces through the geometry of quantile functions, enabling efficient computation of metrics such as the Wasserstein over Wasserstein (WoW) and its double-sliced surrogates. This article delineates the formal underpinnings, algorithmic realizations, and empirical characteristics of these methods, as developed in the context of the double-sliced Wasserstein (DSW) distance (Piening et al., 26 Sep 2025).
1. Quantile Isometry in One Dimension
The 2-Wasserstein distance on the real line between probability measures —those with finite second moment—admits a canonical formulation: where is the set of all couplings of and . For such measures, the quantile or inverse–CDF function, , assigns to the smallest with . is in .
The critical isometry theorem holds: Thus, the mapping given by is an isometric embedding for . The uniqueness and monotonicity of optimal couplings in one dimension (monotone rearrangement) ensure that this embedding exactly captures transport geometry.
The practical implication is that computation of in 1D reduces to the norm between quantiles, allowing OT on to be computed via quantile sorting—a key enabler for scalable OT algorithms.
2. Functional Slicing in Banach Spaces
Extending to general separable Banach spaces , with the continuous dual, slicing is formalized as follows. For , the linear projection defines the push-forward for any probability measure on . For a probability measure on , the -sliced Wasserstein distance is
If is not contained in a proper subspace of , this defines a metric on . Along each direction , the measure is reduced to a 1D problem computable by quantiles; thus, slicing facilitates scalable, projection-based OT on high- or infinite-dimensional spaces.
This framework generalizes the classical sliced-Wasserstein approach (uniform integration over in ) and provides a template for slicing in functional spaces using probability distributions (e.g., Gaussian processes) over .
3. Double-Sliced Wasserstein Distance for Meta-Measures
Comparing meta-measures (probability measures over measures) necessitates more structure. The double-sliced Wasserstein (DSW) metric operates by:
- Outer (Euclidean) slice: For , the induced 1D meta-measure pushes each atomic measure in to its 1D projection along .
- Inner (functional) slice: For and a 1D meta-measure , the map with pushes via projections against random directions.
The DSW metric is
where is uniform measure on the sphere, and is a probability law (e.g., Gaussian process prior) on . DSW thus combines spatial (Euclidean) projection with functional (quantile space) slicing.
For discrete meta-measures supported on finitely many atomic measures, DSW minimization is equivalent to minimizing the original WoW metric: This demonstrates fidelity of DSW as a surrogate for WoW when applied to practical data representations.
4. Algorithmic Structure and Computational Complexity
The computation of DSW proceeds as follows:
- Sample directions on .
- For each , calculate the induced 1D meta-measures and .
- For each , sample directions in .
- For each , project each 1D atomic measure to the scalar .
- Compute between the resulting empirical distributions via quantile sorting.
- Aggregate via a two-level Monte Carlo average.
The computational cost is , where is the maximum support of the atomic measures. In practice, (with the number of meta-measure atoms), leading to considerable computational savings over the naive and required for direct WoW on discrete meta-measures. The method avoids calculation of high-order moments and high-dimensional LPs, relying solely on quantile evaluation and random projections.
5. Empirical Properties and Practical Significance
Numerical experiments demonstrate:
- For shape classification based on local distance distributions, DSW matches WoW in KNN accuracy while reducing computation time by orders of magnitude on large meshes.
- In OTDD (Optimal Transport Dataset Distance) settings for batches of images, DSW correlates with the ground-truth OTDD (Pearson/Spearman with projections), outperforming moment-based approaches that are unstable for high-order moments.
- In generative point-cloud testing, DSW captures distributional phenomena (mode collapse, sensitivity to noise) similarly to OT-NNA, but with linear rather than cubic or quadratic cost in batch or point resolution.
- For image patch distribution matching, DSW aligns with both Euclidean Wasserstein and KID metrics and maintains robustness under various image transformations and sampling artifacts.
These results establish DSW as a scalable, reliable surrogate for high-dimensional and meta-measure OT, without requiring parametric assumptions or higher-order statistics. The adoption of quantile-based isometry ensures numerical stability even with limited projection samples and irregular supports.
6. Connections to Related Methods and Theoretical Implications
Quantile isometry and functional slicing generalize classical sliced-Wasserstein techniques (Bonnotte et al. 2015) and intersect with recent measures for datasets and distributions, including s-OTDD (Nguyen et al. 2025) and point-cloud OT (Piening & Beinert 2025). Functional slicing (integration over infinite-dimensional directions) parallels developments in random projections on Hilbert spaces (Han 2023).
A plausible implication is that the isometric embedding of probability measures into quantile space enables further dimensionality reduction and kernelization strategies for OT beyond Euclidean settings. The restriction to (quadratic cost) is crucial, as only for does the quantile isometry hold; analogous constructions for would not generally preserve metric structure in .
Furthermore, the avoidance of high-order moments circumvents numerical instability endemic to earlier sliced meta-OT methods—particularly for empirical measures with heavy tails or irregular support—while still providing separation and stability guarantees for discrete cases.
Table: Sliced Wasserstein vs. Double-Sliced (Functional) Wasserstein
| Metric | Slicing Domain | Direction Distribution |
|---|---|---|
| Sliced Wasserstein | in | Uniform (Haar) |
| DSW | Sphere GP on |
This tabulation emphasizes the concatenated, two-level projection structure distinguishing DSW from traditional methods.
Overall, quantile isometry and functional slicing establish a theoretically sound and computationally efficient framework for optimal transport comparisons of both data and meta-data distributions, with direct impact on scalable learning and distributional geometry in high dimensions (Piening et al., 26 Sep 2025).