Papers
Topics
Authors
Recent
Search
2000 character limit reached

Time-Varying Persistence Diagrams

Updated 22 December 2025
  • Time-Varying Persistence Diagrams (TVPDs) are sequences of persistence diagrams that capture the evolution of topological features in dynamic datasets by integrating both temporal and spatial information.
  • The Continuous Edit Distance (CED) framework provides a mathematically principled method for aligning, averaging, and clustering TVPDs, ensuring robust metric computation and interpretable geodesics.
  • Dynamic programming and geodesic-based optimization techniques support scalable algorithmic analysis, improving motif discovery and classification in various real-world applications.

Time-varying persistence diagrams (TVPDs) are sequences of persistence diagrams indexed over time, representing the evolution of topological features within time-varying data. TVPDs constitute a foundational structure in topological data analysis for dynamical systems, capturing how homological features such as connected components, loops, and voids emerge, persist, and vanish as data evolve. The robust comparison, alignment, averaging, and clustering of TVPDs require metrics that are simultaneously sensitive to both topological and temporal aspects. The Continuous Edit Distance (CED) framework for TVPDs provides a mathematically principled and computationally tractable metric that generalizes string edit distances and incorporates temporal elasticity, optimal alignment, and interpretable geodesic paths (Tchitchek et al., 15 Dec 2025).

1. Mathematical Definition and Metric Properties

A TVPD is a map PP defined on a collection of time intervals i=1NP[ai,bi]\bigcup_{i=1}^{N_P}[a_i, b_i], typically discretized at a fixed subdivision scale Δ>0\Delta>0, assigning to each tt a persistence diagram P(t)P(t) in the Polish metric space (D,W2)(D, W_2), where W2W_2 is the 2-Wasserstein distance on persistence diagrams. For two TVPDs PP and QQ, and subdivision intervals PiP_i, QjQ_j of length Δ\Delta, the local substitution cost is

dαΔ(Pi,Qj)=t=aibi[(1α)W2(P(t),Q(t+(cjai)))+αcjai]dtd^\Delta_\alpha(P_i, Q_j) = \int_{t=a_i}^{b_i} \left[ (1-\alpha) W_2 \left( P(t), Q\big(t+(c_j-a_i)\big) \right) + \alpha|c_j-a_i| \right] dt

with α(0,1)\alpha \in (0,1) trading off spatial diagram mismatch against temporal misalignment. Deletion and insertion costs are

dαΔ(Pi,A)=aibi(1α)W2(P(t),A)dt,d^\Delta_\alpha(P_i, A) = \int_{a_i}^{b_i} (1-\alpha) W_2\big(P(t), A\big) dt,

where AA is a reference (typically the empty diagram).

The global edit distance, for gap-penalty β>0\beta > 0, is defined by minimizing over all order-preserving partial assignments ff between the subdivisions of PP and QQ: CEDα,βΔ(P,Q)=minf{idomfdαΔ(Pi,Qf(i))+βidomfdαΔ(Pi,A)+βjImfdαΔ(Qj,A)}.CED^\Delta_{\alpha,\beta}(P,Q) = \min_f \Bigg\{ \sum_{i \in \operatorname{dom} f} d^\Delta_\alpha(P_i, Q_{f(i)}) + \beta \sum_{i \notin \operatorname{dom} f} d^\Delta_\alpha(P_i, A) + \beta \sum_{j \notin \operatorname{Im} f} d^\Delta_\alpha(Q_j, A) \Bigg\}. This infimum yields a true metric, satisfying the triangle inequality and nonnegativity for unit-cost choices (Tchitchek et al., 15 Dec 2025).

2. Geodesics and Explicit Constructions

For β=1\beta = 1, there exists an explicit constant-speed geodesic, providing an interpretable optimal path between TVPDs in edit-distance geometry. The geodesic consists of three phases:

  • Deletion: Subintervals of the source TVPD unmatched in the optimal assignment collapse to AA at constant speed.
  • Substitution: Matched subdivisions move continuously (via local dαΔd^\Delta_\alpha-geodesics) toward their targets.
  • Insertion: Unmatched subdivisions in the target TVPD are spawned from AA. This construction enables extraction of midpoint trajectories, facilitates barycenter algorithms, and guarantees minimum-energy paths (Tchitchek et al., 15 Dec 2025).

3. Algorithmic Approaches

The pairwise CED computation is cast as a dynamic programming problem. Letting δ(i,j)\delta(i,j) denote the minimal CED between prefixes (P1,,Pi)(P_1,\dots,P_i), (Q1,,Qj)(Q_1,\dots,Q_j), the Bellman recursion is: δ(i,j)=min{δ(i1,j)+βdαΔ(Pi,A),  δ(i,j1)+βdαΔ(Qj,A),  δ(i1,j1)+dαΔ(Pi,Qj)}\delta(i,j) = \min\Big\{ \delta(i-1, j) + \beta d^\Delta_\alpha(P_i, A),\; \delta(i, j-1) + \beta d^\Delta_\alpha(Q_j, A),\; \delta(i-1, j-1) + d^\Delta_\alpha(P_i, Q_j) \Big\} with initialization δ(i,0)\delta(i, 0) and δ(0,j)\delta(0, j) as cumulative deletion and insertion costs. The table dimension is O(NPNQ)O(N_P N_Q). Precomputation of local costs is dominated by the evaluation of W2W_2 for persistence diagrams at multiple time stamps and typically costs O(mp3)O(m \cdot p^3), where mΔ/ηm \approx \Delta/\eta and pp is the diagram size. The overall pairwise CED computation is O(NPNQmp3)O(N_P N_Q m p^3) (Tchitchek et al., 15 Dec 2025).

4. Barycenters and Fréchet Means in TVPD Spaces

Given a collection {V1,,Vn}\{V_1,\ldots, V_n\} of TVPDs, a barycenter minimizes the CED-based Fréchet functional,

E(X)=i=1n[CEDα,1Δ(X,Vi)]2.E(X) = \sum_{i=1}^n [CED^\Delta_{\alpha, 1}(X, V_i)]^2.

Two optimization strategies, both leveraging explicit CED-geodesics, are employed:

  • Stochastic Geodesic Descent: Initialize at a randomly chosen TVPD; at each iteration, move along a CED-geodesic toward a randomly selected TVPD reference, accepting steps that reduce E(X)E(X).
  • Greedy Geodesic Descent: For each reference TVPD and prescribed step discretization along the geodesic, select the candidate with minimal Fréchet energy.

Both approaches guarantee monotonically non-increasing Fréchet energy and yield local minimizers, affording practical computation of barycenters for alignment and clustering (Tchitchek et al., 15 Dec 2025).

5. Empirical Performance and Robustness

Empirical evaluation demonstrates the CED framework’s robustness to both temporal and spatial perturbations. Perturbing sample times (“temporal jitter”) or the feature values within persistence diagrams leads to near-linear or piecewise-linear CED growth at small noise levels, quantifying stability to real-world noise. CED alignment correctly recovers temporal offsets (e.g., phase shifts in dynamical regime transitions), outperforms standard elastic dissimilarities (such as Dynamic Time Warping and TWED) for motif search, and supports accurate motif window retrievals (Tchitchek et al., 15 Dec 2025).

Clustering and classification via CED-barycenter-based kk-means, both stochastic and greedy, achieve superior or competitive results relative to baseline dissimilarities across sea surface height (SSH), VESTEC combustion, and asteroid impact datasets. Notably, for the VESTEC and Asteroid datasets, only CED-based clustering perfectly recovers the true clusters, as measured by misclassification error (Tchitchek et al., 15 Dec 2025).

6. Computational Bottlenecks and Accelerations

The dominant computational cost is the O(NPNQp3)O(N_P N_Q p^3) dynamic program over all subdivision pairs, with each local cost requiring Wasserstein distance calculations. Two prominent accelerations alleviate this bottleneck:

  • Pruning persistence diagrams below a persistence threshold, reducing the diagram size pp.
  • Restriction of the DP table to a Sakoe–Chiba–type temporal corridor of width wmax(NP,NQ)w \ll \max(N_P, N_Q), reducing table size to O((NP+NQ)w)O((N_P+N_Q)w).

Scaling experiments show that barycenter computations scale principlely with the size of the TVPDs and, for greedy descent, also with the product of the number of references and step count (Tchitchek et al., 15 Dec 2025).

7. Connections and Context

The CED framework for TVPDs generalizes classical edit distances between sequences and exponent-strings (Baek, 2024), and extends these concepts to the setting of topological data evolving in time, combining ideas of elastic alignment and Wasserstein geometry. It unifies temporal and topological dissimilarity under a single metric structure, enabling direct analysis of TVPDs in clustering, motif discovery, and averaging without recourse to feature aggregation or time-series vectorization. The design is flexible in the trade-off parameters (α,β)(\alpha, \beta), admits explicit geodesics, and supports scalable barycenter computation, embedding TVPD analysis firmly within the modern metric learning toolkit.

The metric, explicit geodesic structure, and optimization procedures situate CED as a natural extension of edit-distances, analogous to recent developments in Continuous Fréchet Edit Distance for polygonal curves (Fox et al., 2024), but for higher-level topological descriptors over time. Empirical evidence supports its robustness, interpretability, and performance in applied data analysis contexts involving time-evolving shapes and spatial-temporal fields (Tchitchek et al., 15 Dec 2025).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (3)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Time-Varying Persistence Diagrams (TVPDs).