Continuous-Time Markov Chains (CTMC)

Updated 27 January 2026

CTMCs are stochastic processes on discrete state spaces with memoryless, continuous-time transitions defined by an infinitesimal generator matrix.
They underpin practical applications in systems reliability, biology, phylogenetics, and queuing, utilizing Kolmogorov equations for dynamic analysis.
Advanced inference methods, fluid approximations, and scalable numerical approaches enable robust parameter estimation and efficient analysis of CTMC behavior.

A continuous-time Markov chain (CTMC) is a stochastic process defined on a (finite or countable) discrete state space, evolving by random jumps at continuous time, and characterized by the Markov property—future evolution depends only on the present state, not on the path history. The structure and analysis of CTMCs underpin a substantial domain of probabilistic modeling across system reliability, biology, queuing, phylogenetics, and beyond.

1. Mathematical Foundations and Dynamics

A CTMC $\{X(t): t \geq 0\}$ on a discrete state space $\mathcal{S}=\{1,\dots,m\}$ is completely specified by its infinitesimal generator (rate matrix) $Q = (q_{ij})_{i,j\in\mathcal{S}}$ . Off-diagonal entries $q_{ij} \geq 0$ for $i\neq j$ denote the instantaneous transition rates from state $i$ to $j$ , while the diagonal is set to $q_{ii} = -\sum_{j\neq i} q_{ij}$ . For $h>0$ , the transition probability is

$P[X(t+h) = j \mid X(t) = i] = \delta_{ij} + q_{ij}h + o(h)$

where $\delta_{ij}$ is the Kronecker delta. The Kolmogorov forward (master) equation describes the evolution of the probability vector $p(t)$ : $\frac{dp(t)}{dt} = p(t) Q, \quad p(0) = p_0$ The unique solution is

$p(t) = p(0) \exp(Q t)$

Trajectory-wise, a CTMC remains at $x$ for a sojourn drawn from $\mathrm{Exp}(-q_{xx})$ , then jumps to $y \neq x$ with probability $q_{xy}/(-q_{xx})$ . For systems such as reaction networks, this formalism naturally extends via generator matrices with structured transitions (e.g., via stoichiometry and propensities) (Reeves et al., 2022).

2. Characterization, Long-Term Behavior, and Stationarity

The stationary distribution $\pi$ of a CTMC is any probability vector satisfying $\pi Q = 0$ and $\sum_x \pi(x) = 1$ (Kuntz et al., 2019). Under boundedness in probability and a Foster–Lyapunov drift criterion, existence and uniqueness of such $\pi$ are guaranteed without requiring state-space irreducibility: $Qv(x) \leq -d_1 v(x) + d_2$ for a norm-like $v$ and $d_1,d_2 > 0$ . In this regime, empirical (time- and space-averaged) measures converge to $\pi$ in total variation, and exponential convergence can be established. The complete decomposition of stationary measures follows the breakdown into closed communicating classes and the exclusion of transient sets (Kuntz et al., 2019).

Long-term dynamics also include analysis of recurrence, transience, and explosivity (finite/infinite expected number of jumps in finite time) in polynomial-rate processes (Xu et al., 2019). The classification invokes summability of $\Lambda_+(n)$ (upward total rates) and the product formula for $\pi$ . For example, a birth–death chain with $\lambda_+(x) = \beta x^p$ and $\lambda_-(x) = \delta x^q$ is positive recurrent if $p - q < -1$ and non-explosive if $p \leq 1$ .

3. Inference, Learning, and Parameter Estimation

Parameter estimation for CTMCs is essential for practical deployment. When full trajectories (states, dwelling times) are observed, maximum likelihood estimation (MLE) is feasible by direct maximization of the complete-data likelihood: $L(Q) = \prod_{\text{trajectories}} \biggl[ \prod_{i} q_{x_i,x_{i+1}} e^{-q_{x_i x_i} T_i} \biggr]$ where $T_i$ is the sojourn in state $x_i$ (Abo-Elreesh, 2021, Bacci et al., 2023).

For parametric CTMCs (rate entries as polynomial or nonlinear functions of unknowns), minorization–maximization (MM) schemes or expectation–maximization–style surrogates offer efficient iterative estimation. Incomplete data compensation (e.g., label-only or panel data) employs forward–backward recursions and surrogate construction via Jensen's inequality or arithmetic–geometric mean bounds. Surrogate objectives separate in coordinates, yielding coordinate-wise update rules and convergence to local maxima (Bacci et al., 2023).

When transitions rates depend nonlinearly or depend on external covariates, flexible approaches such as neural network parameterizations (N-CTMC) or Bayesian nonparametric Gaussian process priors have been proposed, enabling the learning of nonlinear or unknown rate functions from observed data (Reeves et al., 2022, Monti et al., 6 Nov 2025). These frameworks outperform log-linear models in settings involving non-mass-action kinetics or unknown input dependencies.

Bayesian inference for discretely observed CTMCs faces complications due to the intractability of the likelihood (matrix exponentials for long sequences and moderate/large state spaces). Recent advances propose pseudo-likelihoods regularized via spectral decompositions of the generator, leading to efficient and embeddability-aware Gibbs sampling algorithms with strong asymptotic guarantees (posterior consistency, Bernstein–von Mises theorems) (Tang et al., 22 Jul 2025).

4. Fluid and Deterministic Approximations

Classical fluid (or mean-field) limits approximate the trajectory of a high-dimensional or large-population CTMC by deterministic ODEs:

Spectral fluid approximation: Decompose $Q$ via its eigenstructure. For symmetric $Q$ , trajectory projections onto eigenmodes yield exponential decay in each independent mode. With non-symmetric $Q$ , truncation to dominant eigen-modes yields efficient low-dimensional surrogates (Michaelides et al., 2019).
Diffusion-map–based geometric fluid approximation (GFA): Embeds the CTMC’s state space in a Euclidean manifold via diffusion maps, infers a drift field via Gaussian process regression, and evolves mean trajectories via ODEs. GFA achieves root mean square errors below 5% for classical models and is provably consistent with known fluid limits (Michaelides et al., 2019).

These methods provide deterministic proxies for sample-path statistics, especially useful for systems with many discrete states where exact simulation is prohibitive.

5. Computational Methods, Scalability, and Numerical Approximations

CTMCs with massive or infinite state spaces require scalable computational approaches:

Truncation-based stationary distribution approximation: Finite-state restrictions (with various boundary schemes) allow the solution of finite-dimensional linear or linear-programming systems and provide error bounds via tail-mass control or Lyapunov drift methods (Kuntz et al., 2019).
Abstraction and "symblicit" analysis: Combine symbolic state aggregation (OBDDs or other data structures) with explicit numerical solutions in the reduced system. This overcomes state-explosion and rate-explosion, enables accurate transient reward analysis, and supports refinement to achieve prescribed precision (Hahn et al., 2012).
Particle-based inference and marginalization: In combinatorial or infinite spaces, Monte Carlo sampling schemes that analytically marginalize out holding times (path-integral based, TIPS algorithm) yield massive variance reductions over classical imputation, making high-dimensional or intractable CTMCs computationally accessible (Hajiaghayi et al., 2013).
High-dimensional posterior sampling: Non-reversible samplers, such as the Local Bouncy Particle Sampler (LBPS), leverage parameter sparsity and piecewise-deterministic Markov process features for efficient traversal of large-parameter CTMC posteriors, substantially outperforming classical HMC in high dimensions (Zhao et al., 2019).

6. Extensions, Mixtures, and Imprecise CTMCs

Mixtures of CTMCs: Mixtures allow modeling of populations or behaviors as arising from latent subpopulations, each with its own generator. Learning CTMC mixtures from partially observed (discretized) trajectories requires tailored soft-clustering, SVD, or EM algorithms, with sample complexity governed by mixing time, trail length, and mixture separation (Spaeh et al., 2024).
Imprecise CTMCs: When parameter estimates are uncertain or only known within bounds, imprecise CTMCs generalize the rate matrix to a set or interval, leading to lower transition rate operators. Nonlinear differential equations governing the lower expectations admit polynomial-time, guaranteed-error approximations via uniform or adaptive schemes. For ergodic models, stationary lower expectations can be approximated efficiently (Erreygers et al., 2017, Krak et al., 2016).
Robust model checking: Logics such as continuous-time linear logic (CLL) enable temporal verification over probability distributions, supporting complex timed and multi-phase properties. CLL model-checking reduces to interval reachability and real-root isolation in polynomial–exponential functions, entailing decidability under Schanuel's Conjecture (Guan et al., 2020).

7. Applications and Empirical Case Studies

CTMCs have been applied extensively:

Systems biology and epidemics: Stochastic reaction networks, gene regulatory models, and epidemic models (e.g., SIR under lockdown) have been analyzed using CTMC frameworks and parameter learning via MM, EM, or N-CTMC approaches (Bacci et al., 2023, Reeves et al., 2022).
Reliability and queuing: Large-scale performability, file systems, and workstation clusters modeled by CTMCs have been analyzed using abstraction and transient reward approximation, yielding error-bounded estimates for highly complex networks (Hahn et al., 2012).
Phylogenetics and evolutionary modeling: CTMCs underlie trait-based phylogeny inference, genome distance estimation, and phylogeography, with inference via maximum likelihood or nonparametric Bayesian GP methods (Amiryousefi, 2021, Monti et al., 6 Nov 2025).
Cancer genomics and mutational progression: Cross-sectional mutation accumulation can be modeled as a CTMC over subsets, with identifiability challenges (underspecification) resolved by augmenting with independent items and scalable learning algorithms capable of handling up to $2^{100}$ states (Gotovos et al., 2021).
User behavior and sports analytics: Mixture CTMCs elucidate diverse regimes in user-sequence data and, as in basketball passing, reveal interpretable tactical structures in sequential event streams (Spaeh et al., 2024).

In summary, CTMCs provide a rigorously structured, versatile framework for modeling continuous-time, discrete-state stochastic dynamics. Advances in scalable inference, robust numerical methods, fluid approximations, and flexible model generalizations continue to expand their reach across scientific domains.