Koopman Spectral Wasserstein Gradient Descent
- KSWGD is a training-free, particle-based generative modeling method that leverages Koopman spectral techniques and optimal transport to drive particles toward unknown target distributions.
- It approximates the inverse Langevin generator via a data-driven, finite-rank spectral surrogate, ensuring constant dissipation rates and accelerated convergence even in high dimensions.
- Empirical results show that KSWGD achieves full support coverage and linear KL decay across benchmarks like S¹ uniform sampling, quadruple well, and MNIST latent generation.
Koopman Spectral Wasserstein Gradient Descent (KSWGD) is a training-free, particle-based generative modeling methodology uniting operator-theoretic spectral analysis with variational optimal transport theory. At its core, KSWGD leverages trajectory or time-series data to approximate the infinitesimal generator of overdamped Langevin dynamics through Koopman spectral techniques, and subsequently drives particles deterministically along a preconditioned Wasserstein gradient flow toward an unknown target distribution, achieving accelerated convergence without explicit knowledge of the target potential or reliance on neural network training (Xu et al., 21 Dec 2025).
1. Definition and Theoretical Framework
KSWGD targets generative sampling problems where only samples (possibly arranged temporally) from an unknown distribution are available. The goal is to transform an initial empirical measure to approximate by discretizing the –Wasserstein gradient flow:
Here, corresponds to the inverse Langevin generator . The key innovation of KSWGD is to approximate in a fully data-driven and finite-rank manner using Koopman spectral methods (such as Extended Dynamic Mode Decomposition, EDMD), resulting in a preconditioned flow with constant dissipation rate even in high dimensions. This approach operationalizes the same mathematical foundation as Laplacian-Adjusted Wasserstein Gradient Descent (LAWGD) but circumvents the need for target potential access or score network training.
The algorithm proceeds as follows:
- Estimate the leading eigenpairs of the generator from time-ordered trajectory pairs.
- Build a truncated spectral surrogate for the inverse Langevin operator.
- Update particles along this Koopman-preconditioned Wasserstein gradient flow.
2. Mathematical Formulation and Algorithm
Wasserstein Gradient Flow and Spectral Preconditioning
The evolution of under the –Wasserstein gradient flow in is characterized by the velocity field
with preconditioning (as in LAWGD) by leading to:
where are eigenpairs of the Langevin generator (self-adjoint on ).
Koopman Spectral Approximation
Recognizing that the Koopman (backward Kolmogorov) generator , KSWGD uses data-driven spectral approximation from trajectory pairs by EDMD to identify leading empirical eigenpairs of , yielding the truncated inverse:
Discrete Particle Update
Given particles , a single KSWGD step of size is:
where
and denotes gradient with respect to the first argument.
Pseudocode
The methodology splits naturally into Offline and Online phases:
| Step | Description | Computational Cost |
|---|---|---|
| Offline (Spectral Est.) | Dictionary selection, data matrix formation, EDMD eigenproblem, obtain leading | (for dictionary size ) |
| Online (Updates) | Iteratively update particles using Koopman-preconditioned flow | per iteration |
Auxiliary notes:
- Computing eigenfunction gradients may require analytic expressions or finite-difference approximations.
- The bias-variance trade-off is controlled by rank (truncation) and step size .
3. Theoretical Properties
Spectral Preconditioning and Dissipation
The preconditioned flow with truncated spectral surrogate satisfies:
with dissipation identity:
where is projection onto the retained -dimensional eigenspace.
Under mild regularity and a tail bound , the ideal convergence rate is:
Data-Driven Error Bounds
If the Koopman spectral approximation error meets:
then
Discrete-time convergence in the Approximate Gradient Flow (AGF) setting yields, for step size :
implying geometric decay to bias modulated by .
Feynman–Kac Perspective
With in the Feynman–Kac formula,
the Koopman semigroup encodes unconditional observable expectations, underlining the probabilistic foundation of KSWGD sampling. Extension to would encompass conditional or rare-event inference.
4. Experimental Validation and Benchmarking
KSWGD's empirical performance was examined across diverse generative modeling milieus:
| Task/Dataset | Particles | Koopman Method | Key Metric(s) |
|---|---|---|---|
| S uniform sampling | 700 | kernel-EDMD (RBF/poly) | KL decay, movement rate, coverage |
| Quadruple well | 500 | SDMD (neural dict.) | KL, well coverage, movement rate |
| MNIST (latent) | 64 | CNN+EDMD dict. learning | Visual sample, KL, divergence |
| Allen–Cahn SPDE | 150 | EDMD (poly features) | Visual fidelity, distributional prediction |
Baselines examined include DMPS (diffusion maps), DDPM, VAE, RealNVP, WGAN-GP. Notable empirical results:
- On S–uniform and quadruple well, KSWGD achieved full support coverage in iterations, while DMPS required .
- Empirical KL decay was linear, validating theoretical predictions.
- On MNIST latent code, KSWGD produced discernible digit samples; DMPS failed under parallel conditions.
- On Allen–Cahn SPDE, KSWGD matched or surpassed DDPM, VAE, normalizing flows, and GANs in latent space sample quality.
5. Computational and Practical Considerations
Key computational and modeling constraints include:
- Offline spectral estimation incurs eigendecomposition, while each online step scales as .
- Gradients of eigenfunctions may demand basis-specific analytic or finite-difference evaluation.
- Bias-variance trade-offs hinge on truncation rank (reducing at increased computational cost) and step size (smaller reduces discretization bias but necessitates longer runs).
- Quality of latent autoencoders or dictionary selection directly bounds generative accuracy in high-dimensional settings.
- Assumptions of ergodicity and self-adjointness of the generator are essential; extensions to non-reversible dynamics necessitate additional research.
6. Connections, Scope, and Limitations
KSWGD synthesizes the spectral guarantees of LAWGD with the data-driven practicality of EDMD, providing a theoretically justified and computationally accessible recipe for generative particle-based sampling. It eliminates the need for explicit potential evaluation or neural-network-based score learning. Current limitations encompass the necessity for high-quality spectral/dictionary approximations, computational scaling with rank and particle count, and the restriction to detailed-balance (oscillating) dynamical systems. Future work may address extensions to irreversible (non-detailed-balance) generators and autonomous dictionary adaptation (Xu et al., 21 Dec 2025).