Synthetic Control Method (SCM)

Updated 5 February 2026

Synthetic Control Method (SCM) is a data-driven causal inference approach that constructs a weighted counterfactual from a pool of control units.
It leverages pre-intervention outcome matching and convex combinations to estimate what would have happened in the absence of treatment.
Extensions of SCM, including augmented methods and dynamic predictor weighting, improve fit diagnostics and address donor selection challenges.

The synthetic control method (SCM) is a data-driven approach for causal inference in comparative case studies, particularly when a single unit (individual, region, organization) is exposed to a treatment or intervention at a specific time, while a pool of other similar units remains untreated. SCM constructs a weighted average of the untreated units (“donors”) to serve as a counterfactual for the treated unit, leveraging longitudinal (panel) data to estimate what would have happened to the treated unit absent the intervention. This framework enables estimation of time-varying treatment effects and facilitates rigorous quantitative policy evaluation in observational settings, especially when randomized experiments are infeasible.

1. Formal Framework and Core Algorithm

SCM targets the problem of estimating the causal effect of an intervention on a single treated unit using a convex combination of donor (control) units matched on pre-intervention outcomes, and optionally covariates. Let $Y_{it}$ denote the outcome for unit $i$ at time $t$ ; without loss of generality, unit $i=1$ undergoes treatment at time $T_0+1$ , while $i=2,...,J+1$ are controls. The objective is to select weights $w=(w_2,...,w_{J+1})$ solving: $\min_{w}\, \sum_{t=1}^{T_{\mathrm{pre}}} \left( Y_{1t} - \sum_{j=2}^{J+1} w_j Y_{jt} \right)^2$ subject to $w_j\geq 0$ and $\sum_{j=2}^{J+1} w_j = 1$ .

The post-treatment counterfactual is then estimated as $\widehat Y_{1t}(0)=\sum_{j=2}^{J+1} \widehat w_j Y_{jt}$ for $t > T_{\mathrm{pre}}$ (Sun, 26 Oct 2025). SCM can be implemented with arbitrary additional covariates or pre-treatment predictors via an analogous matching objective.

Key properties of this construction:

Convexity: The synthetic control is restricted to the convex hull of control unit outcomes, avoiding extrapolation.
No-interference: Standard SCM assumes no spillover of treatment across units (SUTVA).
Exact or approximate pre-fit: SCM is predicated on achieving minimal imbalance in pre-intervention periods; imperfect pre-fit can lead to bias.

2. Inference, Uncertainty Quantification, and Test Statistics

SCM relies on outcome trajectories for observed donor and treated units. Inference on post-treatment effects is often carried out using permutation (“placebo-in-space”) tests:

RMSPE ratio: Compares the ratio of post-to-pre-treatment root-mean-squared prediction error of the treated unit against the placebo distribution over the donor pool.

$\mathrm{RMSPE}_{\mathrm{pre}} = \sqrt{\frac{1}{T_{\mathrm{pre}}}\sum_{t=1}^{T_{\mathrm{pre}}} (Y_{1t} - \widehat Y_{1t})^2},\quad \mathrm{RMSPE}_{\mathrm{post}} = \sqrt{\frac{1}{T_{\mathrm{post}}}\sum_{t=T_{\mathrm{pre}}+1}^{T_{\mathrm{pre}}+T_{\mathrm{post}}} (Y_{1t} - \widehat Y_{1t})^2}$

Post-treatment gap: The mean deviation between observed and synthetic outcomes after treatment, tested against the corresponding placebo distribution.

Permutation $p$ -values are calculated as the empirical fraction of control units with more extreme test statistics than the treated unit (Sun, 26 Oct 2025). For example, in a real-world case, the RMSPE-ratio test gave $p = 0.0508$ , indicating statistical significance at the 10% level, whereas the post-treatment gap was not statistically significant at conventional levels ( $p \approx 0.32$ ).

3. Practical Implementation and Tuning

SCM requires several choices regarding donor selection, predictor sets, and implementation details:

Donor pool: Units matched on characteristics affecting pre-treatment trend and with complete outcome data. Practical implementations often prune donors with missing data or low correlation to the treated trajectory before treatment (Sun, 26 Oct 2025).
Predictors and their weights: Standard SCM allows inclusion of non-outcome covariates (e.g., demographic, economic, structural features). Weighting matrices (denoted $V$ ) can emphasize more predictive pre-treatment periods or covariates. In the cited Altadena wildfire study, only lagged outcomes were used and pre-treatment periods were exponentially downweighted to emphasize recent months.
Weight computation: The optimal $w$ is computed by quadratic programming. Modern implementations often use efficient convex solvers.

Fit diagnostics such as pre-treatment RMSPE indicate the quality of the synthetic control approximation. In high-quality SCM applications (e.g., Altadena), pre-intervention RMSPE can be as low as 0.61% of the treated unit's mean value.

4. Extensions and Variants

A range of SCM extensions address methodological challenges:

Augmented SCM: Incorporates bias correction when exact pre-fit is unachievable via outcome modeling (e.g., penalized regression) (Ben-Michael et al., 2018).
Penalized, Model-Averaged, and Covariate-balanced SCM: Regularization and model averaging mitigate overfitting and improve risk properties when the number of controls is large or pre-treatment fit is low (Pouliot et al., 2022).
Staggered adoption: The framework can be generalized to settings where units receive treatment at different times, requiring partially pooled weights that minimize imbalance both for each unit individually and for the pooled treated average (Ben-Michael et al., 2019).
Dynamic predictor weighting: Exponential or time-varying weights can emphasize recent outcomes or particularly informative covariates (Sun, 26 Oct 2025).

These variants retain the core structure of SCM—using a convex combination of controls as the counterfactual—but relax or adapt various modeling constraints and balance objectives.

5. Limitations, Identification, and Statistical Properties

The validity of SCM estimates is contingent on several identification and modeling assumptions:

**Conv

Markdown Report Issue Upgrade to Chat

References (4)

Wildfire and house prices: A synthetic control case study of Altadena (Jan 2025) (2025)

The Augmented Synthetic Control Method (2018)

Degrees of Freedom and Information Criteria for the Synthetic Control Method (2022)

Synthetic Controls with Staggered Adoption (2019)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Synthetic Control Method (SCM).