Bipartite matching loss is a cost functional for optimal pairing of two independently sampled point clouds, defined by minimizing the average squared Euclidean displacement.
The method employs a linearized electrostatic analogy where the optimal displacement field, given as the gradient of a potential, is computed using Green's functions.
The framework quantifies finite-size effects and spatial correlations, revealing asymptotic scaling laws and logarithmic corrections across different dimensions.
The quadratic stochastic Euclidean bipartite matching loss is a cost functional arising in the optimal assignment between two independently sampled point clouds over a domain Ω⊂Rd. When each point cloud consists of N points sampled according to a density ρ(x) and N≫1, the objective is to determine a permutation mapping that minimizes the average squared Euclidean distance between matched pairs. This framework is closely related to stochastic optimal transport and extends Monge–Kantorovich theory by capturing finite-N fluctuations through a linearized, electrostatic analogy—with explicit formulae for the expected optimal cost and the two-point correlation function of the matching field (Caracciolo et al., 2015).
1. Mathematical Formulation and Problem Structure
Consider a bounded domain Ω⊂Rd, or its d-dimensional flat torus version Td=[0,1]d with periodic boundary conditions. Two independent point sets, R={ri}i=1N and B={bj}j=1N, are sampled i.i.d. from the density ρ(x)>0 on Ω. The assignment σ∗∈SN solves: CN∗=σ∈SNminN1i=1∑N∥ri−bσ(i)∥2.
The loss quantifies the minimal average squared displacement required to match sources to targets.
2. Empirical Measures and Continuum Limit
Define empirical measures: ρR(x)=N1i=1∑Nδ(x−ri),ρB(x)=N1j=1∑Nδ(x−bj),
and their difference ϱ(x)=ρR(x)−ρB(x). As N→∞, both ρR and ρB converge weakly to ρ(x). This continuum perspective enables the reformulation of the matching loss using fields and measures, facilitating analytic derivations.
3. Electrostatic Analogy and Linearization
The optimal displacement field m(x) transporting ρR to ρB is (in the weak sense) the gradient of a potential: m(x)=∇ϕ(x). The push-forward constraint linearizes, yielding the PDE: ∇⋅[ρ(x)∇ϕ(x)]=ϱ(x),∇ϕ⋅n∣∂Ω=0(Neumann BC),
or periodic BC on Td. Introducing the Green's function Gρ(x,y) solving: ∇x⋅[ρ(x)∇xGρ(x,y)]=δ(x−y)−∣Ω∣1,∂nx∂Gρ(x,y)∣x∈∂Ω=0,
the displacement field admits
m(x)=∇ϕ(x)=∫Ω∇xGρ(x,y)ϱ(y)dy.
4. Two-Point Correlation Function
Averaging over all random instantiations of ri,bj, the two-point correlation for the displacement field is
This yields (up to an N-dependent short-distance cutoff): C(x,y)=N2∫Ωρ(z)∇xGρ(x,z)⋅∇yGρ(y,z)dz−N2∬Ω×Ωρ(z)ρ(w)∇xGρ(x,z)⋅∇yGρ(y,w)dzdw.
5. Continuum Formula for Expected Optimal Cost
The average optimal matching loss in the continuum approximation uses the diagonal part of the correlation: CN∗≈∫Ω∥m(x)∥2ρ(x)dx=∫ΩC(x,x)ρ(x)dx,
which leads to the general formula: E[CN∗]≃∫ΩC(x,x)ρ(x)dx=N2∬Ω×Ωρ(x)[ρ(y)Gρ(x,y)−∣Ω∣1Gρ(x,x)]dxdy.
A short-distance cutoff proportional to the typical nearest-neighbor spacing δN∼N−1/d in the integrals is required to regularize divergences.
6. Flat Hypertorus and Explicit Asymptotics
On Td with Poisson density (ρ≡1), the Green function Gd(x−y) solves: ΔxGd(x−y)=δ(x−y)−1,∫TdGd=0.
The corresponding ansatz: C(x,y)≈−N2Gd(x−y),E[CN∗]≈−N2Gd(0),
yields (mode-sum or zeta-function regularization):
d>2: E[CN∗]∼CdN−2/d as N→∞, with Cd=d(2π)dSd−1ζ(1+2d)2.
7. Generalizations and Theoretical Context
For nonuniform density ρ(x) and general domains, correlation functions and expected loss follow the described Green-function prescription. Mean-field scaling E[CN∗]∼N−2/d (with logarithmic correction for d=2) is recovered. This approach extends Monge–Kantorovich theory to stochastic settings by modeling finite-N effects via a weakly linearized electrostatic analogy, capturing random fluctuations in discrete matching problems (Caracciolo et al., 2015).
A plausible implication is that this framework facilitates the systematic study of stochastic transport costs beyond mean-field, including spatial correlation structures and finite-size scaling in bipartite matching problems, with rigorous connection to probabilistic transport theory.