Papers
Topics
Authors
Recent
Search
2000 character limit reached

Variance-Exploding SDEs: Theory & Applications

Updated 4 January 2026
  • Variance-Exploding SDEs are defined as drift-free stochastic processes with a time-dependent diffusion coefficient, forming the basis of generative diffusion models.
  • The ER-SDE framework employs a parameterized noise schedule via φ(σ), allowing interpolation between deterministic ODE solvers and fully stochastic SDE solvers.
  • Efficient VE ER-SDE solvers strike a balance between rapid sampling and high sample diversity, demonstrated by state-of-the-art empirical performance on benchmarks.

Variance-Exploding Stochastic Differential Equations (VE-SDEs) define a class of stochastic processes characterized by increasing variance over time and form a foundational component in generative diffusion models. In VE-SDEs, the forward stochastic differential equation is drift-free with a time-dependent diffusion coefficient, and the backward or reverse-time equation involves a score-based drift. Under the Extended Reverse-Time SDE (ER-SDE) framework, VE-SDEs admit semi-linear solutions with a parameterized noise-schedule, allowing interpolation between fully stochastic and deterministic (ODE) solvers. Approximate closed-form solutions, efficient solvers, and error analyses within this framework provide mathematical and practical insights into the speed, quality, and diversity of @@@@1@@@@ algorithms (Cui et al., 2023).

1. Formal Definition and Structure of VE-SDEs

The Variance-Exploding SDE is defined by a forward equation of the form: dxt=0xtdt+dσt2dtdwt,x0p0(x0)\mathrm{d}x_t = 0\cdot x_t\,\mathrm{d}t + \sqrt{\frac{\mathrm{d}\,\sigma_t^2}{\mathrm{d}t}}\,\mathrm{d}w_t, \qquad x_0 \sim p_0(x_0) where the drift f(t,x)=0f(t, x) = 0, and the diffusion coefficient is g(t)=dσt2dtg(t) = \sqrt{\tfrac{\mathrm{d}\,\sigma_t^2}{\mathrm{d}t}}. The process is initialized from a data distribution p0(x0)p_0(x_0), and wtw_t is the standard Wiener process.

For generative modeling tasks, the reverse-time SDE (specialized from Song et al. 2021 to the VE case) is given by: dxt=[g(t)2xlogpt(xt)]dt+g(t)dwˉt\mathrm{d}x_t = \left[-g(t)^2 \nabla_x \log p_t(x_t)\right] \mathrm{d}t + g(t) \,\mathrm{d}\bar{w}_t with reverse drift frev(t,x)=dσt2dtxlogpt(x)f_{\text{rev}}(t, x) = -\frac{\mathrm{d}\,\sigma_t^2}{\mathrm{d}t} \nabla_x \log p_t(x) and wˉt\bar{w}_t an independent Wiener process.

2. Solution Structure in the Extended Reverse-Time SDE Framework

The ER-SDE framework generalizes both SDE and ODE-based solvers by introducing an independent “reverse” noise scale h(t)h(t) and substituting the learned data-prediction network xθ(xt,t)x_\theta(x_t, t) for the score term.

Switching to the noise-level parameter σ=σt\sigma = \sigma_t, the ER-SDE reads: dxσ=[1σ+ξ(σ)2σ2](xσxθ(xσ,σ))dσ+ξ(σ)dwσ\mathrm{d}x_\sigma = \left[\frac{1}{\sigma} + \frac{\xi(\sigma)}{2\sigma^2}\right](x_\sigma - x_\theta(x_\sigma, \sigma))\,\mathrm{d}\sigma + \sqrt{\xi(\sigma)}\,\mathrm{d}w_\sigma with ξ(σ)0\xi(\sigma) \ge 0, such that h2(t)=ξ(σt)dσtdth^2(t) = \xi(\sigma_t)\,\frac{\mathrm{d}\sigma_t}{\mathrm{d}t}. Define

ϕ(σ)=exp([1σ+ξ(σ)2σ2]dσ)\phi(\sigma) = \exp\left(\int\left[\frac{1}{\sigma} + \frac{\xi(\sigma)}{2\sigma^2}\right] \,\mathrm{d}\sigma\right)

with ϕ(σ)/ϕ(σ)=1σ+ξ(σ)2σ2\phi'(\sigma)/\phi(\sigma) = \frac{1}{\sigma} + \frac{\xi(\sigma)}{2\sigma^2}.

The exact solution for evolving xx from σs\sigma_s to σt\sigma_t (Proposition 1) is: xt=ϕ(σt)ϕ(σs)xs+ϕ(σt) ⁣σtσs ⁣ϕ(σ)ϕ(σ)2xθ(xσ,σ)dσ+σt2σs2[ϕ(σt)ϕ(σs)]2zx_t = \frac{\phi(\sigma_t)}{\phi(\sigma_s)}\,x_s + \phi(\sigma_t)\!\int_{\sigma_t}^{\sigma_s}\! \frac{\phi'(\sigma)}{\phi(\sigma)^2}\,x_\theta(x_\sigma, \sigma) \,\mathrm{d}\sigma + \sqrt{\sigma_t^2 - \sigma_s^2\left[\frac{\phi(\sigma_t)}{\phi(\sigma_s)}\right]^2}z where zN(0,I)z \sim \mathcal{N}(0, I). In practice, the nonlinear integral is approximated using Taylor expansion.

3. Efficient VE ER-SDE Solvers: Algorithmic Construction

The first-order VE ER-SDE-Solver has the following update for each step: x~t=rx~s+(1r)xθ(x~s,σs)+σt2r2σs2z\tilde x_t = r\,\tilde x_s + (1 - r)x_\theta(\tilde x_s, \sigma_s) + \sqrt{\sigma_t^2 - r^2 \sigma_s^2}z where r=ϕ(σt)/ϕ(σs)r = \phi(\sigma_t)/\phi(\sigma_s), and zN(0,I)z \sim \mathcal{N}(0, I). Each update utilizes a single evaluation of xθx_\theta.

Pseudocode for the algorithmic workflow (as described in Algorithm 1) is:

Step Description Details
1 Initialization x0xTx_0 \gets x_T (initial noise sample)
2 For i=1,...,Mi = 1, ..., M rϕ(σi)/ϕ(σi1)r \gets \phi(\sigma_i)/\phi(\sigma_{i-1})
zN(0,I)z \sim \mathcal{N}(0, I)
x^xi1\hat{x} \gets x_{i-1}
drift xθ(x^,σi1)\gets x_\theta(\hat{x}, \sigma_{i-1})
noise σi2r2σi12z\gets \sqrt{\sigma_i^2 - r^2 \sigma_{i-1}^2} \cdot z
xirx^+(1r)x_i \gets r\cdot\hat{x} + (1 - r)\cdot drift ++ noise
3 Output xMx_M

The method is tunable via the choice of ϕ(σ)\phi(\sigma), the number of steps MM, and the schedule tiσit_i \mapsto \sigma_i. Higher-order versions reuse evaluations and add finite-difference corrections at the cost of additional network calls per step.

4. Local Discretization Error and Theoretical Analysis

Discretization error is governed by the First-Order Euler Integral (FEI) coefficient: FEI=1ϕ(σt)ϕ(σs)\mathrm{FEI} = 1 - \frac{\phi(\sigma_t)}{\phi(\sigma_s)} which quantifies the dominant local one-step error: x~txt=O((σtσs)2)\tilde x_t - x_t = \mathcal{O}((\sigma_t - \sigma_s)^2) The minimum FEI is achieved by choosing ϕ(σ)=σ\phi(\sigma) = \sigma, corresponding to the deterministic probability-flow ODE, resulting in the lowest discretization error among the ER-SDE family. Any larger ϕ()\phi(\cdot) increases FEI and the corresponding global error for equal step-size.

A change of variable demonstrates that VE and VP ER-SDEs share the same FEI coefficient, establishing parity between these formulations for a given pretrained model and fixed number of function evaluations (NFE).

5. Stochasticity, Sample Quality, and Diversity

ODE-based samplers with ϕ(σ)=σ\phi(\sigma) = \sigma have minimal local error but no injected noise, leading to less sample diversity. Choosing ϕ()\phi(\cdot) close to σ\sigma but sufficiently large to inject controlled noise allows ER-SDE-based VE solvers to achieve near-ODE fidelity without sacrificing diversity. This stochasticity-efficiency tradeoff is central: VE-SDE solvers interpolate between pure SDE and ODE processes, balancing rapid low-NFE sampling with high sample quality and output variability.

6. Practical Considerations and Empirical Findings

VE ER-SDE-Solvers are parameterized by the noise-scale function ϕ(σ)\phi(\sigma), number of steps MM, and schedule mapping tiσit_i \mapsto \sigma_i. First-order solvers require just one network evaluation per step. Advanced higher-order variants increase per-step cost for potentially improved empirical accuracy.

Empirical evaluation on the ImageNet 128×128128 \times 128 benchmark demonstrates that ER-SDE-Solvers attain state-of-the-art performance across stochastic samplers while maintaining the efficiency typical of deterministic samplers (e.g., $8.33$ FID in $20$ function evaluations) (Cui et al., 2023). This suggests that appropriate tuning of ϕ()\phi(\cdot) enables simultaneous optimization of sample quality and computational efficiency.

7. Significance and Theoretical Summary

The ER-SDE framework unifies ODE and SDE sampling methodologies for VE-SDEs, providing a family of semi-linear solutions whose error and stochasticity are parametrically controlled by ϕ(σ)\phi(\sigma). The key theorem asserts that, among all extended reverse-time SDEs with a given drift-score model, the ODE (h(t)=0,ϕ(x)=xh(t) = 0, \phi(x) = x) uniquely minimizes the local discretization error. A plausible implication is that careful functional choice allows constructing samplers that closely approach ODE performance while preserving the stochastic effects essential for output variability and model robustness (Cui et al., 2023).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Variance-Exploding Stochastic Differential Equations (VE-SDEs).