- The paper establishes sharp local sparsity rates for conditionals of regularized optimal transport, with the support shrinking at the rate ε^(1/[d(p-1)+2]).
- The authors employ convex-analytic and PDE-based techniques, leveraging Legendre duality and interior regularity to prove uniform strong convexity and convergence of ROT potentials.
- The findings offer actionable insights for improving algorithmic stability and error analysis in high-dimensional regimes where regularized OT is applied.
Sharp Local Sparsity of Regularized Optimal Transport
Problem Setting and Motivation
This paper analyzes sharp local sparsity properties of regularized optimal transport (ROT) problems with Lp-type entropy regularization for p∈(1,2]. The considered problem is
ROTε,p:=π∈Π(λ,μ),π≪λ⊗μinf∫21∥x−y∥2dπ+ε∫hp(d(λ⊗μ)dπ)d(λ⊗μ)
where hp(z)=p−1∣z∣p−1, λ, μ are probability measures with smooth densities on compact supports in Rd, and Π(λ,μ) denotes couplings matching both marginals. Lp-type regularization includes quadratic regularization and interpolates between classical OT and entropic OT.
Unlike the entropic (EOT) case, where optimal plans have full support, the ROT plans are inherently sparse: for small ε, the support of the ROT plan p∈(1,2]0 contracts toward the OT plan support. This sparsification is a fundamental feature exploited in various computational and statistical applications, as it enables reduced sample complexity and tractable computations with high-dimensional data.
One of the main open questions is to determine, locally and globally, sharp rates at which this contraction of support manifests as p∈(1,2]1, especially in the interior of the support, and establish corresponding quantitative regularity properties for the associated convex dual potentials.
Sharp Local Sparsity Rates
The core result provides precise asymptotics for the contraction of the support of regularized optimal couplings, in particular, for the fibers p∈(1,2]2. The paper proves that, for measures on p∈(1,2]3 with sufficiently regular densities and for p∈(1,2]4 in the interior of the support,
p∈(1,2]5
where p∈(1,2]6 is controlled by local geometry. This means each conditional support around the dual solution p∈(1,2]7 contracts like an Euclidean ball with radius scale p∈(1,2]8 as p∈(1,2]9. The rate is demonstrated to be sharp by explicit computations in the case of self-transport on tori and remains valid for all ROTε,p:=π∈Π(λ,μ),π≪λ⊗μinf∫21∥x−y∥2dπ+ε∫hp(d(λ⊗μ)dπ)d(λ⊗μ)0 and dimension ROTε,p:=π∈Π(λ,μ),π≪λ⊗μinf∫21∥x−y∥2dπ+ε∫hp(d(λ⊗μ)dπ)d(λ⊗μ)1.
An important technical innovation is the localized analysis: the authors establish these bounds away from the boundary of the support, and with explicit dependency on distance to the boundary. These quantitative estimates generalize and extend the one-dimensional rates in [Wiesel & Xu, SIAM J. Math. Anal., 2025] and previous work [Gonzalez-Sanz & Nutz, SIAM J. Math. Anal., forthcoming] to fully multivariate settings and general marginals.
A second principal contribution is the proof of uniform strong convexity (in the sense of lower bounds on the Hessian) for the regularized dual potential ROTε,p:=π∈Π(λ,μ),π≪λ⊗μinf∫21∥x−y∥2dπ+ε∫hp(d(λ⊗μ)dπ)d(λ⊗μ)2 in interior regions. Specifically, for any compact ROTε,p:=π∈Π(λ,μ),π≪λ⊗μinf∫21∥x−y∥2dπ+ε∫hp(d(λ⊗μ)dπ)d(λ⊗μ)3 in the interior,
ROTε,p:=π∈Π(λ,μ),π≪λ⊗μinf∫21∥x−y∥2dπ+ε∫hp(d(λ⊗μ)dπ)d(λ⊗μ)4
for all ROTε,p:=π∈Π(λ,μ),π≪λ⊗μinf∫21∥x−y∥2dπ+ε∫hp(d(λ⊗μ)dπ)d(λ⊗μ)5 and sufficiently small ROTε,p:=π∈Π(λ,μ),π≪λ⊗μinf∫21∥x−y∥2dπ+ε∫hp(d(λ⊗μ)dπ)d(λ⊗μ)6, with ROTε,p:=π∈Π(λ,μ),π≪λ⊗μinf∫21∥x−y∥2dπ+ε∫hp(d(λ⊗μ)dπ)d(λ⊗μ)7 depending on the distance to the boundary. This leverages sharp a priori regularity estimates for the ROT potential ROTε,p:=π∈Π(λ,μ),π≪λ⊗μinf∫21∥x−y∥2dπ+ε∫hp(d(λ⊗μ)dπ)d(λ⊗μ)8, ensuring the restriction to the interior of the domain is both necessary and sufficient given the boundary layer structure of the ROT problem.
The authors further provide optimal ROTε,p:=π∈Π(λ,μ),π≪λ⊗μinf∫21∥x−y∥2dπ+ε∫hp(d(λ⊗μ)dπ)d(λ⊗μ)9 convergence rates for the ROT transport maps: hp(z)=p−1∣z∣p−10
where hp(z)=p−1∣z∣p−11 is the unique potential for the classic OT problem and the distances are taken over hp(z)=p−1∣z∣p−12 in the interior. The analysis combines convex geometric arguments with careful duality theory and differentiability properties of the regularized potentials.
Explicit Model and Sharpness
The main theoretical predictions are validated via explicit solutions in the case of self-transport with Lebesgue marginals on the torus hp(z)=p−1∣z∣p−13, where the problem is exactly solvable. In this setting, the dual optimizer is constant, so the structure of hp(z)=p−1∣z∣p−14 is readily analyzed. The asymptotics for the support diameter exactly match hp(z)=p−1∣z∣p−15, demonstrating the sharpness of the derived rates extends beyond abstract upper/lower bounds to concrete examples.
Comparison with Prior Work
The results strictly strengthen prior local and global support contraction bounds for quadratic ROT (hp(z)=p−1∣z∣p−16) to arbitrary hp(z)=p−1∣z∣p−17-ROT and general dimensions, providing effective control in the multivariate and non-symmetric non-self-transport settings. This unifies, sharpens, and extends earlier foundational studies on ROT sample complexity [Gonzalez-Sanz, del Barrio, Nutz, (González-Sanz et al., 12 Nov 2025)] and sparsity structure [Wiesel & Xu, SIAM J. Math. Anal., 2025; Zhang et al., (Zhang et al., 2023)].
A key distinction from the entropic regime (hp(z)=p−1∣z∣p−18) is emphasized: in contrast to EOT, where plans are never sparse and have full support independent of hp(z)=p−1∣z∣p−19, ROT produces couplings whose support indeed collapses to the OT solution at a quantifiable rate as regularization vanishes.
Implications and Future Directions
The established rates have direct implications for:
- Algorithmic design: The sharp local contraction rate can be used to localize computations, resulting in scalable algorithms for high-dimensional regularized OT, exploiting support sparsity for computational efficiency.
- Statistical analysis: Explicit rates yield near-optimal bounds for the sample complexity of ROT estimators, crucial in empirical applications such as domain adaptation, sample-based estimation, and generative modeling.
- Theory of regularized variational problems: The methods extend to other divergence-regularized OT models, potentially yielding parallel results for more general cost functions and regularizers.
The uniform convexity results may inform new approaches in understanding regularization-induced smoothing in variational inference and PDE-based transport models, possibly connecting to the qualitative analysis of the porous medium equation as referenced in related work.
Potential extensions include:
- Quantitative boundary layer estimates bridging interior and global results.
- Broadening the analysis to non-Euclidean geometries and singular measures.
- Application to structured machine learning tasks where induced sparsity of λ0 is desiderata.
Conclusion
This work delivers a mathematically rigorous characterization of the local geometric structure of regularized optimal transport with λ1-type entropies, providing precise, dimension- and regularization-dependent rates for support contraction, uniform strong convexity of dual potentials, and convergence to classic OT solutions. These results establish definitive benchmarks for both theoretical understanding and practical deployment of ROT in high-dimensional transport and statistical learning settings (2604.00843).