Papers
Topics
Authors
Recent
Search
2000 character limit reached

Multi-step EDMD for Nonlinear Systems

Updated 24 January 2026
  • Multi-step EDMD is a data-driven framework that leverages Koopman operator theory to approximate nonlinear dynamics via linear representations in a lifted function space.
  • It minimizes long-horizon errors by directly learning multi-step state outputs, avoiding error compounding seen in recursive single-step methods.
  • The framework employs advanced dictionary learning and sparsity techniques, yielding robust surrogates applicable to density forecasting, system identification, and model predictive control.

The multi-step Extended Dynamic Mode Decomposition (EDMD) framework is a family of data-driven numerical methods for forecasting, modeling, and controlling nonlinear dynamical systems via linear approximations in function space. By leveraging action of the Koopman operator on a finite set of observables, multi-step EDMD enables high-fidelity multi-horizon prediction and robust surrogate model construction for stochastic, deterministic, and controlled systems. Key advantages over classical single-step EDMD include direct minimization of long-horizon errors, avoidance of error compounding under recursive prediction, and improved tractability for real-time and control applications.

1. Mathematical Foundations and Koopman Lifting

The multi-step EDMD framework generalizes the spectral approach of Koopman operator theory to nonlinear systems, including both deterministic maps and stochastic differential equations (SDEs) with diffusion. Consider a continuous-time SDE

dXt=b(Xt) dt+σ(Xt) dWtdX_t = b(X_t)\,dt + \sigma(X_t)\,dW_t

with initial distribution X0∼p0(x)X_0 \sim p_0(x), drift b:Rd→Rdb:\mathbb{R}^d\to\mathbb{R}^d, and noise σ:Rd→Rd×s\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}, or a discrete-time system xt+1=f(xt,ut)x_{t+1}=f(x_t, u_t). The associated stochastic Koopman semigroup {Kt}\{K^t\} acts on observables ψ\psi by Ktψ(x)=E[ψ(Xtx)]K^t\psi(x) = \mathbb{E}[\psi(X_t^x)], defining a linear semigroup generated by the infinitesimal generator

Lv(x)=b(x)⋅∇v(x)+12Σ(x):∇2v(x),Σ(x)=σ(x)σ(x)⊤,\mathcal{L}v(x) = b(x)\cdot\nabla v(x) + \frac{1}{2}\Sigma(x):\nabla^2 v(x),\quad \Sigma(x)=\sigma(x)\sigma(x)^\top,

with Fokker–Planck adjoint L∗p=−∇⋅[b p]+12∇⋅∇⋅[Σp]\mathcal{L}^*p = -\nabla\cdot[b\,p] + \frac{1}{2}\nabla\cdot\nabla\cdot[\Sigma p] (Zhao et al., 2022). Under ergodicity, there exists a unique invariant density X0∼p0(x)X_0 \sim p_0(x)0, and the X0∼p0(x)X_0 \sim p_0(x)1 Hilbert space inner product governs the spectral structure of X0∼p0(x)X_0 \sim p_0(x)2 and X0∼p0(x)X_0 \sim p_0(x)3.

Deterministic systems use analogous constructions, with observables X0∼p0(x)X_0 \sim p_0(x)4 spanning a finite-dimensional subspace, and evolution in this lifted space governed by approximations of the infinite-dimensional Koopman operator (Schurig et al., 10 Nov 2025, Meda et al., 4 Apr 2025).

2. Finite-Dimensional EDMD Approximations and Multi-Step Forecasting

The core idea of EDMD is to approximate X0∼p0(x)X_0 \sim p_0(x)5 by a finite-dimensional matrix acting on the lifted state, using empirical data:

  • Snapshot pairs X0∼p0(x)X_0 \sim p_0(x)6 are generated via dynamics and possible control inputs.
  • The observables define the lift X0∼p0(x)X_0 \sim p_0(x)7; the data matrices are X0∼p0(x)X_0 \sim p_0(x)8.
  • Gram matrices are X0∼p0(x)X_0 \sim p_0(x)9, b:Rd→Rdb:\mathbb{R}^d\to\mathbb{R}^d0.
  • The Koopman approximation is b:Rd→Rdb:\mathbb{R}^d\to\mathbb{R}^d1, with eigenpairs b:Rd→Rdb:\mathbb{R}^d\to\mathbb{R}^d2 giving approximate eigenvalues and eigenfunctions (Zhao et al., 2022, Meda et al., 4 Apr 2025).

Multi-step prediction proceeds by recursive iteration:

b:Rd→Rdb:\mathbb{R}^d\to\mathbb{R}^d3

where the lifted state b:Rd→Rdb:\mathbb{R}^d\to\mathbb{R}^d4, b:Rd→Rdb:\mathbb{R}^d\to\mathbb{R}^d5 selects physical states from the lifted coordinates, and b:Rd→Rdb:\mathbb{R}^d\to\mathbb{R}^d6 recovers outputs or measurements (Meda et al., 4 Apr 2025). In stochastic density forecasting, spectral decompositions using b:Rd→Rdb:\mathbb{R}^d\to\mathbb{R}^d7 generate multi-step probability evolution via b:Rd→Rdb:\mathbb{R}^d\to\mathbb{R}^d8 or eigenmode propagation (Zhao et al., 2022).

3. Dictionary Design, Manifold Optimization, and Sparsity

The expressiveness and generalization of EDMD surrogates depend critically on the choice of observables. Standard bases include monomials, radial basis functions (RBFs), and neural networks (Meda et al., 4 Apr 2025). Dictionary learning is further refined by geometric optimization on the Grassmann manifold: selecting a subspace b:Rd→Rdb:\mathbb{R}^d\to\mathbb{R}^d9 that minimizes multi-step forecast error via Riemannian optimization yields approximately invariant, low-dimensional subspaces that are robust to out-of-domain states (Schurig et al., 10 Nov 2025). The error metric integrates projection error over finite horizons and test initial conditions, and the optimization descends to Grassmannian geometry with Riemannian gradients and QR-based retraction.

Explicit dictionary pruning and structure discovery are enabled by including σ:Rd→Rd×s\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}0-type regularization in least-squares identification, which removes irrelevant observables and yields parsimonious, efficient surrogates (Wu et al., 17 Jan 2026). Parallel decomposition across states, steps, and dictionary rows supports scalable computation.

4. Multi-Step Least-Squares Identification and Error Control

A key advance of the multi-step EDMD framework is direct learning of the condensed multi-step state-output map. Rather than identifying an operator σ:Rd→Rd×s\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}1 for single-step propagation and composing σ:Rd→Rd×s\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}2 (which leads to error compounding and potential instability for σ:Rd→Rd×s\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}3), the framework fits for each prediction step σ:Rd→Rd×s\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}4 a linear map from dictionary lifts and past inputs to the σ:Rd→Rd×s\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}5-step-ahead state:

σ:Rd→Rd×s\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}6

The identification is performed via convex least-squares:

σ:Rd→Rd×s\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}7

with row-wise σ:Rd→Rd×s\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}8 sparsity for dictionary pruning (Wu et al., 17 Jan 2026). This produces stable, direct multi-step surrogates whose error does not grow with prediction horizon σ:Rd→Rd×s\sigma:\mathbb{R}^d\to\mathbb{R}^{d\times s}9. Theorem 3 in (Wu et al., 17 Jan 2026) provides, under Sobolev regularity and bounded-dictionary assumptions, high-probability error bounds independent of xt+1=f(xt,ut)x_{t+1}=f(x_t, u_t)0:

xt+1=f(xt,ut)x_{t+1}=f(x_t, u_t)1

where xt+1=f(xt,ut)x_{t+1}=f(x_t, u_t)2 is the dictionary size, xt+1=f(xt,ut)x_{t+1}=f(x_t, u_t)3 the number of data, and xt+1=f(xt,ut)x_{t+1}=f(x_t, u_t)4 the degree for polynomial lifts. In contrast, one-step EDMD bounds exhibit exponential-in-xt+1=f(xt,ut)x_{t+1}=f(x_t, u_t)5 error blow-up when xt+1=f(xt,ut)x_{t+1}=f(x_t, u_t)6.

5. Applications, Numerical Performance, and Practical Considerations

Multi-step EDMD has been demonstrated for prediction, nonlinear system identification, density forecasting, and model predictive control (MPC) across a wide range of systems:

  • Density forecast for SDEs: Approximates the semigroup generated by the Fokker–Planck operator, achieving weak convergence of the density representation as data and dictionary sizes increase (Zhao et al., 2022).
  • Nonlinear system surrogates: For electric vehicle (EV) cabin climate, multi-step EDMD with RBF dictionaries achieves state RMSE xt+1=f(xt,ut)x_{t+1}=f(x_t, u_t)7 and power RMSE xt+1=f(xt,ut)x_{t+1}=f(x_t, u_t)8 for xt+1=f(xt,ut)x_{t+1}=f(x_t, u_t)9 basis, outperforming polynomial and neural-network dictionaries at small {Kt}\{K^t\}0 (Meda et al., 4 Apr 2025).
  • Koopman MPC: Direct multi-step identification solves the full condensed map for state and control over control horizon {Kt}\{K^t\}1, enabling stable closed-loop control where one-step EDMD leads to diverging errors and controller instability. In benchmarks, multi-step EDMD achieves stable MSE under open and closed loop for {Kt}\{K^t\}2, while one-step MSE diverges for {Kt}\{K^t\}3 due to spectral radius {Kt}\{K^t\}4 (Wu et al., 17 Jan 2026).
  • Dictionary learning: Grassmannian-based shaping yields low-dimensional surrogates with out-of-domain generalization and large speedups in predictor evaluation (Schurig et al., 10 Nov 2025).

Per-step computational costs are governed by {Kt}\{K^t\}5 matrix-vector products and {Kt}\{K^t\}6 dictionary evaluation (with {Kt}\{K^t\}7 cost per basis). For {Kt}\{K^t\}8–{Kt}\{K^t\}9, real-time execution on embedded platforms is practical (Meda et al., 4 Apr 2025).

6. Algorithmic and Implementation Summary

A generic multi-step EDMD workflow proceeds as:

  1. Dictionary selection: Choose functions ψ\psi0—polynomial, RBF, or learned—appropriate to system structure and data availability.
  2. Data acquisition: Collect snapshot (state, next-state) or (trajectory) data under representative state and control distributions.
  3. Lifting and matrix assembly: Formulate ψ\psi1, ψ\psi2 and associated Gram matrices; assemble multi-step input-output structures for supervised learning.
  4. Identification: Solve (regularized) least-squares for operator matrices, possibly under parallelizable decomposition across step ψ\psi3 and state ψ\psi4; perform dictionary pruning as necessary.
  5. Prediction: For given initial condition and input sequence, propagate via

ψ\psi5

up to desired horizon, reconstruct state and output via ψ\psi6 and ψ\psi7 matrices.

  1. Validation: Assess with trajectory RMSE, Consistency Index (CI), or finite-horizon forecast errors; compare performance with alternative dictionaries or identification schemes (Zhao et al., 2022, Meda et al., 4 Apr 2025, Schurig et al., 10 Nov 2025, Wu et al., 17 Jan 2026).

When relevant (e.g., Grassmannian dictionary learning), implement Riemannian optimization to shape active observables for improved multi-step invariance and forecast accuracy (Schurig et al., 10 Nov 2025). Closed-loop extensions with linear MPC/LQR or online adaptation using recursive least squares are direct, leveraging the lifted linear structure of the surrogate.

7. Convergence Guarantees and Limitations

Convergence properties are established in terms of increasing data (ψ\psi8), basis set expressiveness (ψ\psi9), and regularity of the lifting functions. For stochastic systems, EDMD approximations converge to the Galerkin projection of the Fokker–Planck semigroup in Ktψ(x)=E[ψ(Xtx)]K^t\psi(x) = \mathbb{E}[\psi(X_t^x)]0, and the truncated density forecast Ktψ(x)=E[ψ(Xtx)]K^t\psi(x) = \mathbb{E}[\psi(X_t^x)]1 converges weakly to the true solution (Zhao et al., 2022). In deterministic and control settings, multi-step EDMD avoids error blow-up inherent to one-step recursion, with finite-time sample error scaling optimally in dictionary size and sample count (Wu et al., 17 Jan 2026). Overfitting and lack of generalization can arise with excessive dictionary dimension, motivating use of Grassmannian shaping and sparsity regularization (Schurig et al., 10 Nov 2025, Wu et al., 17 Jan 2026).

A plausible implication is that careful system-specific dictionary design, appropriate regularization, and direct multi-horizon identification are essential for the robust practical application of EDMD in high-dimensional settings and real-time control scenarios.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Multi-step EDMD Framework.