Papers
Topics
Authors
Recent
Search
2000 character limit reached

Quadratic Bipartite Matching Loss

Updated 15 December 2025
  • Bipartite matching loss is a cost functional for optimal pairing of two independently sampled point clouds, defined by minimizing the average squared Euclidean displacement.
  • The method employs a linearized electrostatic analogy where the optimal displacement field, given as the gradient of a potential, is computed using Green's functions.
  • The framework quantifies finite-size effects and spatial correlations, revealing asymptotic scaling laws and logarithmic corrections across different dimensions.

The quadratic stochastic Euclidean bipartite matching loss is a cost functional arising in the optimal assignment between two independently sampled point clouds over a domain ΩRd\Omega \subset \mathbb{R}^d. When each point cloud consists of NN points sampled according to a density ρ(x)\rho(x) and N1N \gg 1, the objective is to determine a permutation mapping that minimizes the average squared Euclidean distance between matched pairs. This framework is closely related to stochastic optimal transport and extends Monge–Kantorovich theory by capturing finite-NN fluctuations through a linearized, electrostatic analogy—with explicit formulae for the expected optimal cost and the two-point correlation function of the matching field (Caracciolo et al., 2015).

1. Mathematical Formulation and Problem Structure

Consider a bounded domain ΩRd\Omega\subset\mathbb{R}^d, or its d-dimensional flat torus version Td=[0,1]d\mathsf{T}^d = [0,1]^d with periodic boundary conditions. Two independent point sets, R={ri}i=1NR = \{r_i\}_{i=1}^N and B={bj}j=1NB = \{b_j\}_{j=1}^N, are sampled i.i.d. from the density ρ(x)>0\rho(x) > 0 on Ω\Omega. The assignment σSN\sigma^* \in S_N solves: CN=minσSN1Ni=1Nribσ(i)2.C_N^* = \min_{\sigma \in S_N} \frac{1}{N} \sum_{i=1}^N \| r_i - b_{\sigma(i)} \|^2. The loss quantifies the minimal average squared displacement required to match sources to targets.

2. Empirical Measures and Continuum Limit

Define empirical measures: ρR(x)=1Ni=1Nδ(xri),ρB(x)=1Nj=1Nδ(xbj),\rho_R(x) = \frac{1}{N} \sum_{i=1}^N \delta(x - r_i), \qquad \rho_B(x) = \frac{1}{N} \sum_{j=1}^N \delta(x - b_j), and their difference ϱ(x)=ρR(x)ρB(x)\varrho(x) = \rho_R(x) - \rho_B(x). As NN \to \infty, both ρR\rho_R and ρB\rho_B converge weakly to ρ(x)\rho(x). This continuum perspective enables the reformulation of the matching loss using fields and measures, facilitating analytic derivations.

3. Electrostatic Analogy and Linearization

The optimal displacement field m(x)m(x) transporting ρR\rho_R to ρB\rho_B is (in the weak sense) the gradient of a potential: m(x)=ϕ(x)m(x) = \nabla \phi(x). The push-forward constraint linearizes, yielding the PDE: [ρ(x)ϕ(x)]=ϱ(x),ϕnΩ=0(Neumann BC),\nabla \cdot [\rho(x) \nabla \phi(x)] = \varrho(x), \qquad \nabla \phi \cdot n|_{\partial\Omega} = 0 \quad (\text{Neumann BC}), or periodic BC on Td\mathsf{T}^d. Introducing the Green's function Gρ(x,y)G_\rho(x, y) solving: x[ρ(x)xGρ(x,y)]=δ(xy)1Ω,nxGρ(x,y)xΩ=0,\nabla_x \cdot [\rho(x)\nabla_x G_\rho(x, y)] = \delta(x - y) - \frac{1}{|\Omega|}, \qquad \frac{\partial}{\partial n_x} G_\rho(x, y)|_{x \in \partial\Omega} = 0, the displacement field admits

m(x)=ϕ(x)=ΩxGρ(x,y)ϱ(y)dy.m(x) = \nabla \phi(x) = \int_\Omega \nabla_x G_\rho(x, y) \,\varrho(y)\,dy.

4. Two-Point Correlation Function

Averaging over all random instantiations of ri,bjr_i, b_j, the two-point correlation for the displacement field is

C(x,y)m(x)m(y)=xGρ(x,z)yGρ(y,w)ϱ(z)ϱ(w)dzdw,C(x, y) \coloneqq \overline{m(x) \cdot m(y)} = \iint \nabla_x G_\rho(x, z)\nabla_y G_\rho(y, w)\overline{\varrho(z)\varrho(w)}\,dz\,dw,

with

ϱ(z)ϱ(w)=2ρ(z)N[δ(zw)ρ(w)].\overline{\varrho(z)\varrho(w)} = 2\frac{\rho(z)}{N} [\delta(z - w) - \rho(w)].

This yields (up to an NN-dependent short-distance cutoff): C(x,y)=2NΩρ(z)xGρ(x,z)yGρ(y,z)dz2NΩ×Ωρ(z)ρ(w)xGρ(x,z)yGρ(y,w)dzdw.C(x, y) = \frac{2}{N} \int_\Omega \rho(z)\, \nabla_x G_\rho(x, z)\cdot\nabla_y G_\rho(y, z)\,dz - \frac{2}{N} \iint_{\Omega\times\Omega} \rho(z)\rho(w)\,\nabla_x G_\rho(x, z)\cdot\nabla_y G_\rho(y, w)\,dz\,dw.

5. Continuum Formula for Expected Optimal Cost

The average optimal matching loss in the continuum approximation uses the diagonal part of the correlation: CNΩm(x)2ρ(x)dx=ΩC(x,x)ρ(x)dx,C_N^* \approx \int_\Omega \| m(x) \|^2 \rho(x)\,dx = \int_\Omega C(x, x)\rho(x)\,dx, which leads to the general formula: E[CN]ΩC(x,x)ρ(x)dx=2NΩ×Ωρ(x)[ρ(y)Gρ(x,y)1ΩGρ(x,x)]dxdy.\boxed{ \mathbb{E}\left[C_N^*\right] \simeq \int_\Omega C(x, x)\rho(x)\,dx = \frac{2}{N} \iint_{\Omega \times \Omega} \rho(x)\left[\rho(y) G_\rho(x, y) - \frac{1}{|\Omega|} G_\rho(x, x)\right]dx\,dy. } A short-distance cutoff proportional to the typical nearest-neighbor spacing δNN1/d\delta_N \sim N^{-1/d} in the integrals is required to regularize divergences.

6. Flat Hypertorus and Explicit Asymptotics

On Td\mathsf{T}^d with Poisson density (ρ1\rho \equiv 1), the Green function Gd(xy)G_d(x-y) solves: ΔxGd(xy)=δ(xy)1,TdGd=0.\Delta_x\, G_d(x - y) = \delta(x - y) - 1, \qquad \int_{\mathsf{T}^d} G_d = 0. The corresponding ansatz: C(x,y)2NGd(xy),E[CN]2NGd(0),C(x, y) \approx -\frac{2}{N} G_d(x-y), \qquad \mathbb{E}[C_N^*] \approx -\frac{2}{N} G_d(0), yields (mode-sum or zeta-function regularization):

  • d=1d = 1: C(x,y)=16xy(1xy)6NC(x, y) = \frac{1 - 6|x - y|(1 - |x - y|)}{6N}, E[CN]=16N+o(1/N)\mathbb{E}[C_N^*] = \frac{1}{6N} + o(1/N),
  • d=2d = 2: E[CN]=lnN2πN+γN+o(1/N)\mathbb{E}[C_N^*] = \frac{\ln N}{2\pi N} + \frac{\gamma}{N} + o(1/N) for constant γ\gamma,
  • d>2d > 2: E[CN]CdN2/d\mathbb{E}[C_N^*] \sim C_d\, N^{-2/d} as NN \to \infty, with Cd=2d(2π)dSd1ζ(1+d2)C_d = \frac{2}{d(2\pi)^d S_{d-1} \zeta\left(1 + \frac{d}{2}\right)}.

7. Generalizations and Theoretical Context

For nonuniform density ρ(x)\rho(x) and general domains, correlation functions and expected loss follow the described Green-function prescription. Mean-field scaling E[CN]N2/d\mathbb{E}[C_N^*] \sim N^{-2/d} (with logarithmic correction for d=2d=2) is recovered. This approach extends Monge–Kantorovich theory to stochastic settings by modeling finite-NN effects via a weakly linearized electrostatic analogy, capturing random fluctuations in discrete matching problems (Caracciolo et al., 2015).

A plausible implication is that this framework facilitates the systematic study of stochastic transport costs beyond mean-field, including spatial correlation structures and finite-size scaling in bipartite matching problems, with rigorous connection to probabilistic transport theory.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Bipartite Matching Loss.