Optimal Transport-Governed Controlled Annealing

Updated 20 January 2026

Optimal Transport-Governed Controlled Annealing is a framework that redefines annealing schedules as controlled geodesic flows in the space of probability measures.
The methodology employs entropy and KL-divergence regularization with adaptive step sizes to balance approximation and relaxation errors.
Algorithmic variants like annealed Sinkhorn and trust-region paths enable efficient sampling and convergence in high-dimensional, multimodal problems.

Optimal Transport-Governed Controlled Annealing is the mathematical and algorithmic paradigm in which annealing schedules, designed to interpolate between easy-to-sample and complex target distributions or measures, are systematically constructed and governed by principles from optimal transport (OT) theory. This framework underpins a diverse range of methodologies in computational mathematics, stochastic optimization, and machine learning, encompassing algorithms that adaptively modulate entropy regularization, steer Markovian or flow-based dynamics, or enforce constraint satisfaction by transporting probability measures along geodesic or near-geodesic paths subject to physical, statistical, or computational criteria.

1. Mathematical Foundations: Controlled Annealing as Optimal Transport

At the core of optimal transport-governed annealing lies the reformulation of annealing trajectories as geodesic or controlled curves in the space of probability measures, typically represented either by density evolution under continuity equations or by entropy-regularized functionals. The central objects are distributions $\{\mu_t\}_{t\in[0,T]}$ (e.g., Gibbs or other exponential families parameterized by an inverse-temperature $\beta(t)$ ), along with velocity fields $v_t$ that solve the continuity equation: $\partial_t \mu_t + \nabla \cdot (\mu_t v_t) = 0,$ where $v_t$ minimizes the average kinetic energy $\int |v_t|^2 d\mu_t$ (Benamou–Brenier dynamic OT). This characterization ensures minimal-norm evolution along Wasserstein geodesics and admits an explicit gradient-potential formulation: $v_t = \nabla \varphi_t,\quad -\nabla\cdot(\mu_t \nabla\varphi_t) = \partial_t \mu_t.$ Key results establish that such controlled flows arise naturally in simulated annealing and in the dynamics of overdamped Langevin and piecewise-deterministic Markov processes, ensuring that the evolving law matches $\mu_t$ exactly when controlled with $v_t$ (Molin et al., 11 Apr 2025, Aurell et al., 2010).

2. Annealing Schedules, Regularization, and Error Trade-offs

Annealing protocols based on OT principles often involve entropy or KL-divergence regularization schemes, interpolating between a tractable initial measure and the desired target. For instance, the entropy-regularized OT problem (EOT) introduces an inverse temperature $\beta$ : $EOT_\beta(p,q) = \min_{\pi \in \Gamma(p,q)}\langle c, \pi \rangle + \beta^{-1} KL(\pi \| \pi^{ref}),$ where $\beta$ controls the strength of the entropic term. In contrast to using a fixed $\beta$ , OT-governed annealing operates with a non-decreasing schedule $\{\beta_t\}$ to gradually reduce regularization and approach the exact OT solution.

Theoretical analysis reveals two primary sources of error in such iterative procedures: the entropic (approximation) error $O(\beta_t^{-1})$ and the relaxation error $O(\beta_t - \beta_{t-1})$ . Optimal convergence trade-offs are attained when $\beta_t \sim t^{1/2}$ , yielding $O(t^{-1/2})$ rates; further, "debiased" algorithms reduce relaxation bias and achieve faster rates ( $O(t^{-2/3})$ for an optimized schedule) by reweighting marginals via dual OT potentials (Chizat, 2024).

3. Algorithmic Realizations and OT-Governed Annealing Variants

Multiple algorithmic incarnations of OT-governed annealing have been established in the literature:

Annealed Sinkhorn and Debiased Variants: Standard Sinkhorn's algorithm is extended by annealing $\beta_t$ ; debiasing via adaptively reweighted marginals removes leading-order relaxation error and enables faster annealing schedules, all with $O(mn)$ per-iteration complexity (Chizat, 2024, Delalande, 2021).
Trust-Region Annealing in Path Space: In stochastic optimal control and path-space inference, trust-region KL constraints induce deterministic/geometric annealing schedules $\{P^{u_i}\}$ between prior and target path measures, transporting via

$P^{u_i} \propto P^{1-\beta_i} Q^{\beta_i},$

where the parameter increments $\Delta\beta$ are adaptively chosen to achieve constant Fisher–Rao steps, yielding equispaced geodesic discretizations on statistical manifolds (Blessing et al., 17 Aug 2025).

Particle-Based and Flow-Based Annealing: Controlled Monte Carlo Diffusion, Annealing Flow, and other CNF-based samplers employ OT-driven loss functionals—often with dynamic or Wasserstein-regularized objectives—guiding the ensemble from reference to target measure via physical, score-difference, or minimal-action vector fields (Vargas et al., 2023, Wu et al., 2024).
Constrained Density Estimation via Annealed Penalties: Annealing-like barrier and mollifier parameters enforce expectation constraints and regularization inequalities along OT-minimizing sample paths, iteratively reducing constraint mollification and barrier strengths for convergence to feasible optima (Hu et al., 11 Jan 2026).

4. Links to Thermodynamics, Stochastic Control, and Geometry

The connection between OT-governed annealing and nonequilibrium thermodynamics has been rigorously formalized. Minimizing work or heat during rapid transitions corresponds to driving protocols along OT geodesics (Lagrangian view) or Hamilton–Jacobi–Burgers flows; the optimal control transforms between initial and final equilibrium distributions with minimal dissipation. Friction-tensor (Riemannian) geometry on control space is shown to be equivalent to $L^2$ -Wasserstein OT geometry; in finite-time regimes, optimal controlled paths combine a geodesic (slow manifold) part with explicit counterdiabatic corrections derived from the Fisher information (Zhong et al., 2024, Aurell et al., 2010).

In modern stochastic optimal control, OT schedules govern trust-region constrained updates, enforcing small KL divergences per step to maintain numerical stability and ensure convergence along Fisher–Rao geodesics, distinct from classic Wasserstein geodesics except in Gaussian-linear settings (Blessing et al., 17 Aug 2025).

5. Convergence Analysis, Scalings, and Complexity

Rigorous convergence and complexity results have been obtained for OT-based annealing schemes:

For entropic semi-discrete OT, nearly tight convergence bounds on Sinkhorn potentials enable quantitative design of $\varepsilon$ -scaling schedules, justifying warm starts and providing explicit rates for primal and dual errors as well as suboptimality in the transport cost (Delalande, 2021).
Trust-region annealing with adaptive Fisher–Rao steps allows the interpolation to be equispaced in information geometry, yielding stable and sample-efficient algorithms in high-dimensional and multimodal contexts (Blessing et al., 17 Aug 2025, Wu et al., 2024).
Controlled annealing with particle-based approximations leverages the stability of OT maps under weak convergence, ensuring that empirical dynamics converge to the ideal velocity field as the number of particles and time discretization increase (Molin et al., 11 Apr 2025).

The resulting methods achieve both theoretical guarantees and competitive empirical performance: controlled OT-driven schemes accelerate adaptation, escape local minima more rapidly, and outperform static or naively scheduled annealing in terms of metric accuracy (e.g., $W_2$ distance, ESS, and suboptimality).

6. Empirical Comparisons and Pareto-Optimality

Across representative simulated and practical problems—ranging from double-well and multimodal synthetic landscapes to high-dimensional path-sampling and density estimation in finance—OT-governed annealing methods exhibit clear advantages:

"Debiased Annealed Sinkhorn" with an optimized schedule matches or exceeds the speed-accuracy Pareto front of standard Sinkhorn across all target errors in a single run, outperforming fixes at different $\beta$ in both smooth and unstructured cost settings (Chizat, 2024).
"Annealing Flow" and "Controlled Monte Carlo Diffusions" sample inherently multi-modal and high-dimensional targets with better mode coverage, mixing, and efficiency than purely entropy-based or uncontrolled schemes (needing $\sim10$ steps vs. $\sim100$ ) (Wu et al., 2024, Vargas et al., 2023).
In stochastic optimal control and generative modeling, OT-constrained annealing over path space using trust-region or counterdiabatic corrections achieves lower bias, higher stability, and reduced computational budget compared to fixed-grid or naive annealing (Blessing et al., 17 Aug 2025, Zhong et al., 2024).

7. Limitations, Open Directions, and Extensions

The known limitations of OT-governed controlled annealing include the inability to surpass $O(t^{-1/2})$ rates in traditional entropic annealing unless relaxation bias is explicitly eliminated, as shown in debiasing procedures. Full nonasymptotic proofs for some empirically robust debiased methods are still open problems (Chizat, 2024).

The unification of OT principles with stochastic control, thermodynamic geometry, and SDE-driven particle methods continues to inform the development of accelerated algorithms, adaptive step-size selection, and scalable priors for density and constraint-enforced estimation. Extensions to settings with inequality constraints, high-dimensional priors, and unbalanced or measure-valued targets remain active research areas (Hu et al., 11 Jan 2026).

In summary, optimal transport-governed controlled annealing provides a principled, geometrically-structured, and computationally viable basis for designing and analyzing annealing algorithms in high-dimensional statistical computation, inference, and optimization, underpinned by rigorous error bounds, robust convergence properties, and concrete physical interpretations (Chizat, 2024, Molin et al., 11 Apr 2025, Blessing et al., 17 Aug 2025, Wu et al., 2024, Zhong et al., 2024, Aurell et al., 2010, Hu et al., 11 Jan 2026, Delalande, 2021).