Papers
Topics
Authors
Recent
Search
2000 character limit reached

Adapted Bures–Wasserstein Space

Updated 7 February 2026
  • Adapted Bures–Wasserstein space is an extension of classical Bures–Wasserstein geometry that incorporates causality constraints in multistage Gaussian processes.
  • It defines an explicit metric by minimizing the Frobenius norm over block-lower-triangular orbits, yielding clear geodesics and well-structured tangent spaces.
  • The non-negative Alexandrov curvature and explicit formulation facilitate robust sequential inference, filtering, and optimal transport applications in high-dimensional temporal models.

The adapted Bures–Wasserstein (ABW) space is a recent extension of the classical Bures–Wasserstein geometry, incorporating causality or adaptedness constraints that are fundamental in multistage stochastic systems, especially discrete-time Gaussian processes. ABW space provides a natural setting for optimal transport with temporal structure, enabling explicit distances, geodesics, and tangent spaces, and exhibiting Alexandrov non-negative curvature. At its core, the ABW space organizes Gaussian process laws as orbits of block-lower-triangular (filtration-respecting) linear mappings, modulo adapted orthogonal transformations, with a metric determined by the minimal Frobenius norm in this orbit class. This construction simultaneously generalizes the classical Bures–Wasserstein geometry of Gaussian measures/covariances and allows for rigorous geometric analysis in multistage and adapted optimal transport (Acciaio et al., 31 Jan 2026, Gunasingam et al., 2024).

1. Formal Definition and Structure

The ABW space arises from endowing the space of multivariate, discrete-time Gaussian processes with the adapted Wasserstein distance. Given a fixed time horizon T1T \ge 1 and state dimension dd, a centered Gaussian process X=(X1,,XT)X = (X_1, \dots, X_T) is represented by

Xt=s=1tLt,sGs,L=(Lt,s)1stTX_t = \sum_{s=1}^t L_{t,s}\,G_s,\quad L = (L_{t,s})_{1 \le s \le t \le T}

where G1,,GTG_1, \dots, G_T are independent N(0,Id)N(0, I_d) random vectors, and LL is a block-lower-triangular matrix of size dT×dTdT \times dT. Two processes, specified by (a,L)(a, L) and (b,M)(b, M), are equivalent if their laws coincide under adapted (block-diagonal) orthogonal transformations.

The adapted Bures–Wasserstein distance dd0 between orbits dd1 and dd2 is given by

dd3

where dd4 is the group of block-diagonal orthogonal matrices, and dd5 denotes the Frobenius norm (Acciaio et al., 31 Jan 2026). This metric is the minimizer of a Procrustes-type problem and is well-defined on the quotient space of all such orbits.

When dd6 (single-stage), the ABW space reduces to the original Bures–Wasserstein geometry for positive definite matrices; that is, dd7 coincides with the classical Bures–Wasserstein distance.

2. Metric Geometry and Alexandrov Curvature

The ABW space dd8 is an Alexandrov space with non-negative curvature. The fundamental semi-concavity (triangle comparison) inequality holds for any constant-speed geodesic dd9 and point X=(X1,,XT)X = (X_1, \dots, X_T)0 in the space: X=(X1,,XT)X = (X_1, \dots, X_T)1 This property is inherited from the geometry of X=(X1,,XT)X = (X_1, \dots, X_T)2 spaces under X=(X1,,XT)X = (X_1, \dots, X_T)3-interpolations and persists under the restriction to (regular) Gaussian processes and their orbits. The resulting space is geodesically convex: the set of all nondegenerate (regular) Gaussian processes forms a closed convex subset, with well-behaved tangent cones and exponential maps (Acciaio et al., 31 Jan 2026). The non-negative curvature facilitates contraction properties and stability for optimization and sampling algorithms in the space.

3. Geodesics, Tangent Cones, and Exponential Maps

In ABW space, geodesics between points X=(X1,,XT)X = (X_1, \dots, X_T)4 and X=(X1,,XT)X = (X_1, \dots, X_T)5 are explicit and correspond to linearly interpolated block-lower-triangular matrices, up to adaptation via minimizing block-diagonal orthogonals. Specifically, for some X=(X1,,XT)X = (X_1, \dots, X_T)6 achieving the infimum,

X=(X1,,XT)X = (X_1, \dots, X_T)7

Every geodesic arises from such an interpolation, and non-uniqueness emerges only when degeneracies in the conditional covariances are present (Acciaio et al., 31 Jan 2026).

The tangent cone at X=(X1,,XT)X = (X_1, \dots, X_T)8 is given by the quotient X=(X1,,XT)X = (X_1, \dots, X_T)9, where

Xt=s=1tLt,sGs,L=(Lt,s)1stTX_t = \sum_{s=1}^t L_{t,s}\,G_s,\quad L = (L_{t,s})_{1 \le s \le t \le T}0

with an explicit identification of the pseudo-metric. In the nondegenerate (regular) case, Xt=s=1tLt,sGs,L=(Lt,s)1stTX_t = \sum_{s=1}^t L_{t,s}\,G_s,\quad L = (L_{t,s})_{1 \le s \le t \le T}1, and the tangent cone becomes a Hilbert space with inner product Xt=s=1tLt,sGs,L=(Lt,s)1stTX_t = \sum_{s=1}^t L_{t,s}\,G_s,\quad L = (L_{t,s})_{1 \le s \le t \le T}2.

The exponential map is linear: for Xt=s=1tLt,sGs,L=(Lt,s)1stTX_t = \sum_{s=1}^t L_{t,s}\,G_s,\quad L = (L_{t,s})_{1 \le s \le t \le T}3 sufficiently small,

Xt=s=1tLt,sGs,L=(Lt,s)1stTX_t = \sum_{s=1}^t L_{t,s}\,G_s,\quad L = (L_{t,s})_{1 \le s \le t \le T}4

The logarithm is single-valued precisely when there is a unique geodesic, i.e., when the ranks of diagonal blocks match (Acciaio et al., 31 Jan 2026).

For two finite-dimensional Gaussian laws Xt=s=1tLt,sGs,L=(Lt,s)1stTX_t = \sum_{s=1}^t L_{t,s}\,G_s,\quad L = (L_{t,s})_{1 \le s \le t \le T}5 and Xt=s=1tLt,sGs,L=(Lt,s)1stTX_t = \sum_{s=1}^t L_{t,s}\,G_s,\quad L = (L_{t,s})_{1 \le s \le t \le T}6 on Xt=s=1tLt,sGs,L=(Lt,s)1stTX_t = \sum_{s=1}^t L_{t,s}\,G_s,\quad L = (L_{t,s})_{1 \le s \le t \le T}7, the ABW distance specializes to (Gunasingam et al., 2024): Xt=s=1tLt,sGs,L=(Lt,s)1stTX_t = \sum_{s=1}^t L_{t,s}\,G_s,\quad L = (L_{t,s})_{1 \le s \le t \le T}8 where Xt=s=1tLt,sGs,L=(Lt,s)1stTX_t = \sum_{s=1}^t L_{t,s}\,G_s,\quad L = (L_{t,s})_{1 \le s \le t \le T}9, G1,,GTG_1, \dots, G_T0 are the Cholesky factors, and the infimum is over all bicausal couplings—i.e., couplings respecting the temporal filtration. The optimal bicausal coupling is characterized by block-diagonally signed permutations applied to G1,,GTG_1, \dots, G_T1 such that each diagonal term G1,,GTG_1, \dots, G_T2 is replaced by its modulus.

This construction is fundamentally different from the classical Bures–Wasserstein structure, as it fails to coincide with any smooth Riemannian metric and is generally incomplete for processes of rank at most two (Acciaio et al., 31 Jan 2026, Gunasingam et al., 2024).

The ABW space recovers the classical Bures–Wasserstein metric when G1,,GTG_1, \dots, G_T3, wherein

G1,,GTG_1, \dots, G_T4

and the geodesic, tangent, and metric structures coincide with the well-established Riemannian geometry of positive-definite matrices (Oostrum, 2020, Thanwerdas et al., 2022).

In the adapted case, the strict filtration-respecting requirement modifies both the minimal transport cost and geodesic structure, making the adapted distance strictly greater than the classical Bures–Wasserstein distance whenever causality constraints are nontrivial. The lack of a global manifold structure and the presence of non-uniqueness in geodesics for degenerate multi-stage processes further underscores this distinction (Acciaio et al., 31 Jan 2026, Gunasingam et al., 2024).

6. Applications and Implications

The ABW space is central in problems of adapted optimal transport, sequential inference, and filtering of Gaussian processes:

  • It provides a rigorous metric for distances between stochastic processes with adapted structures (e.g., filtering, smoothing, and control of time-series).
  • Its non-negative Alexandrov curvature ensures stability properties useful for iterative algorithms, such as barycenter computation and optimal transport interpolation.
  • The special role of the "regular" subspace ensures that, for nondegenerate processes, both geodesics and tangent cones are globally well-posed and tractable.
  • Explicit metrization via the block-lower-triangular Procrustes problem enables algorithmic implementations exploiting matrix factorizations and singular value decompositions (Acciaio et al., 31 Jan 2026).

A plausible implication is that, in high-dimensional temporal models (including control and signal-processing applications), the ABW space may enable both computationally efficient and statistically robust transport-based algorithms, extending the reach of optimal transport and information geometry into stochastic process settings.

7. Connections to Quantum Information and Matrix-Valued Transport

At G1,,GTG_1, \dots, G_T5, the ABW construction is equivalent to the Bures metric relevant in quantum information, connecting it directly to quantum fidelity and the geometry of quantum density matrices (Kroshnin et al., 2019, Oostrum, 2020). Extensions to matrix-valued (e.g., operator- or tensor-valued) transport with generalized Benamou–Brenier dynamics further underline the versatility of the underlying geometric architecture (Li et al., 2020). The regularized and dynamically weighted analogs in the ABW framework provide a path toward unified theories of unbalanced, adapted, and quantum transport geometries.


Key references: (Acciaio et al., 31 Jan 2026, Gunasingam et al., 2024, Thanwerdas et al., 2022, Oostrum, 2020, Li et al., 2020, Kroshnin et al., 2019).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Adapted Bures--Wasserstein Space.