Papers
Topics
Authors
Recent
Search
2000 character limit reached

Large-Sample Properties of the Synthetic Control Method under Selection on Unobservables

Published 22 Nov 2023 in econ.EM | (2311.13575v2)

Abstract: We analyze the synthetic control (SC) method in panel data settings with many units. We assume the treatment assignment is based on unobserved heterogeneity and pre-treatment information, allowing for both strictly and sequentially exogenous assignment processes. We show that the critical property that determines the behavior of the SC method is the ability of input features to approximate the unobserved heterogeneity. Our results imply that the SC method delivers asymptotically normal estimators for a large class of linear panel data models as long as the number of pre-treatment periods is sufficiently large, making it a natural alternative to the Difference-in-Differences.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (80)
  1. Alberto Abadie. Semiparametric difference-in-differences estimators. The Review of Economic Studies, 72(1):1–19, 2005.
  2. Alberto Abadie. Using synthetic controls: Feasibility, data requirements, and methodological aspects. Journal of Economic Literature, 59(2):391–425, 2021.
  3. The economic costs of conflict: A case study of the basque country. American Economic Review, 93(-):113–132, 2003.
  4. A penalized synthetic control estimator for disaggregated data. Journal of the American Statistical Association, 116(536):1817–1834, 2021.
  5. Synthetic control methods for comparative case studies: Estimating the effect of California’s tobacco control program. Journal of the American Statistical Association, 105(490):493–505, 2010.
  6. Sampling-based versus design-based uncertainty in regression analysis. Econometrica, 88(1):265–296, 2020.
  7. Jaap H Abbring and Gerard J Van den Berg. The nonparametric identification of treatment effects in duration models. Econometrica, 71(5):1491–1517, 2003.
  8. Democracy does cause growth. Journal of political economy, 127(1):47–100, 2019.
  9. The time series and cross-section asymptotics of dynamic panel data estimators. Econometrica, 71(4):1121–1159, 2003.
  10. Julius J Andersson. Carbon taxes and co 2 emissions: Sweden as a case study. American Economic Journal: Economic Policy, 11(4):1–30, 2019.
  11. Identification of and correction for publication bias. American Economic Review, 109(8):2766–2794, 2019.
  12. Manuel Arellano. Panel data econometrics. Oxford university press, 2003.
  13. Doubly robust identification for causal panel data models. The Econometrics Journal, 25(3):649–674, 2022.
  14. Synthetic difference-in-differences. American Economic Review, 111(12):4088–4118, 2021.
  15. Finite-sample optimal estimation and inference on average treatment effects under unconfoundedness. Econometrica, 89(3):1141–1177, 2021.
  16. Orley Ashenfelter. Estimating the effect of training programs on earnings. The Review of Economics and Statistics, pages 47–57, 1978.
  17. Using the longitudinal structure of earnings to estimate the effect of training programs. The Review of Economics and Statistics, 67(4):648–660, 1985.
  18. Approximate residual balancing: debiased inference of average treatment effects in high dimensions. Journal of the Royal Statistical Society Series B: Statistical Methodology, 80(4):597–623, 2018.
  19. Jushan Bai. Inferential theory for factor models of large dimensions. Econometrica, 71(1):135–171, 2003.
  20. Jushan Bai. Panel data models with interactive fixed effects. Econometrica, 77(4):1229–1279, 2009.
  21. High-dimensional methods and inference on structural and treatment effects. Journal of Economic Perspectives, 28(2):29–50, 2014.
  22. The balancing act in causal inference. arXiv preprint arXiv:2110.14831, 2021a.
  23. The augmented synthetic control method. Journal of the American Statistical Association, 116(536):1789–1803, 2021b.
  24. Synthetic controls with staggered adoption. Journal of the Royal Statistical Society Series B: Statistical Methodology, 84(2):351–381, 2022.
  25. Factor and factor loading augmented estimators for panel regression with possibly nonstrong factors. Journal of Business & Economic Statistics, 41(1):270–281, 2022.
  26. Initial conditions and moment restrictions in dynamic panel data models. Journal of econometrics, 87(1):115–143, 1998.
  27. Market share, market value and innovation in a panel of british manufacturing firms. The review of economic studies, 66(3):529–554, 1999.
  28. Individual effects and dynamics in count data models. Journal of econometrics, 108(1):113–131, 2002.
  29. Discretizing unobserved heterogeneity. Econometrica, 90(2):625–643, 2022.
  30. Revisiting event study designs: Robust and efficient estimation. arXiv preprint arXiv:2108.12419, 2021.
  31. Difference-in-differences with multiple time periods. Journal of Econometrics, 225(2):200–230, 2021.
  32. Prediction intervals for synthetic control methods. Journal of the American Statistical Association, 116(536):1865–1880, 2021.
  33. Uncertainty quantification in synthetic controls with staggered treatment adoption. arXiv preprint arXiv:2210.05026, 2022.
  34. Catastrophic natural disasters and economic growth. Review of Economics and Statistics, 95(5):1549–1561, 2013.
  35. Gary Chamberlain. Multivariate regression models for panel data. Journal of econometrics, 18(1):5–46, 1982.
  36. Gary Chamberlain. Panel data. Handbook of econometrics, 2:1247–1318, 1984.
  37. Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal, 21(1):C1–C68, 2018.
  38. An exact and robust conformal inference method for counterfactual and synthetic controls. Journal of the American Statistical Association, 116(536):1849–1864, 2021.
  39. Technology and big data are changing economics: Mining text to track methods. In AEA Papers and Proceedings, volume 110, pages 42–48. American Economic Association 2014 Broadway, Suite 305, Nashville, TN 37203, 2020.
  40. Two-way fixed effects estimators with heterogeneous treatment effects. American Economic Review, 110(9):2964–2996, 2020.
  41. Balancing, regression, difference-in-differences and synthetic control methods: A synthesis. Technical report, National Bureau of Economic Research, 2016.
  42. Synthetic controls with imperfect pretreatment fit. Quantitative Economics, 12(4):1197–1221, 2021.
  43. Linear panel regressions with two-way unobserved heterogeneity. Journal of Econometrics, 237(1):105498, 2023.
  44. Selection and parallel trends. arXiv preprint arXiv:2203.09001, 2022.
  45. Mathematical foundations of infinite-dimensional statistical models. Cambridge university press, 2021.
  46. Andrew Goodman-Bacon. Difference-in-differences with variation in treatment timing. Journal of Econometrics, 225(2):254–277, 2021.
  47. Inverse probability tilting for moment condition models with missing data. The Review of Economic Studies, 79(3):1053–1079, 2012.
  48. Asymptotically unbiased inference for a dynamic panel model with fixed effects when both n and t are large. Econometrica, 70(4):1639–1657, 2002.
  49. Jens Hainmueller. Entropy balancing for causal effects: A multivariate reweighting method to produce balanced samples in observational studies. Political analysis, 20(1):25–46, 2012.
  50. Dynamic discrete choice and dynamic treatment effects. Journal of Econometrics, 136(2):341–396, 2007.
  51. Efficient estimation of average treatment effects using the estimated propensity score. Econometrica, 71(4):1161–1189, 2003.
  52. Augmented minimax linear estimation. The Annals of Statistics, 49(6):3206–3227, 2021.
  53. Estimating vector autoregressions with panel data. Econometrica: Journal of the econometric society, pages 1371–1395, 1988.
  54. Covariate balancing propensity score. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 76(1):243–263, 2014.
  55. Causal Inference in Statistics, Social, and Biomedical Sciences. Cambridge University Press, 2015.
  56. Information theoretic approaches to inference in moment condition models. Econometrica, 66(2):333–357, 1998.
  57. The labor market impacts of universal and permanent cash transfers: Evidence from the alaska permanent fund. American Economic Journal: Economic Policy, 14(2):315–340, 2022.
  58. Learning subgaussian classes: Upper and minimax bounds. arXiv preprint arXiv:1305.4825, 2013.
  59. Shahar Mendelson. Learning without concentration. In Conference on Learning Theory, pages 25–39, 2014.
  60. Shahar Mendelson. Learning without concentration. Journal of the ACM (JACM), 62(3):1–25, 2015.
  61. Shahar Mendelson. Upper bounds on product and multiplier empirical processes. Stochastic Processes and their Applications, 126(12):3652–3680, 2016.
  62. Face masks considerably reduce covid-19 cases in germany. Proceedings of the National Academy of Sciences, 117(51):32293–32301, 2020.
  63. Linear regression for panel with unknown number of factors as interactive fixed effects. Econometrica, 83(4):1543–1579, 2015.
  64. Dynamic linear panel regression models with interactive fixed effects. Econometric Theory, 33(1):158–195, 2017.
  65. Jerzey Neyman. On the application of probability theory to agricultural experiments. essay on principles. section 9. Statistical Science, 5(4):465–472, 1923/1990.
  66. Stephen Nickell. Biases in dynamic models with fixed effects. Econometrica: Journal of the econometric society, pages 1417–1426, 1981.
  67. Design-based uncertainty for quasi-experiments. arXiv preprint arXiv:2008.00602, 2020.
  68. A more credible approach to parallel trends. Review of Economic Studies, page rdad018, 2023.
  69. Marginal structural models and causal inference in epidemiology. Epidemiology, pages 550–560, 2000.
  70. Jonathan Roth. Pretest with caution: Event-study estimates after testing for parallel trends. American Economic Review: Insights, 4(3):305–322, 2022.
  71. Donald B Rubin. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology, 66(5):688–701, 1974.
  72. Susanne M Schennach. Point estimation with exponentially tilted empirical likelihood. The Annals of Statistics, pages 634–672, 2007.
  73. Estimating dynamic treatment effects in event studies with heterogeneous treatment effects. Journal of Econometrics, 225(2):175–199, 2021.
  74. Zhiqiang Tan. Model-assisted inference for treatment effects using regularized calibrated estimation with high-dimensional data. The Annals of Statistics, 48(2):811–837, 2020.
  75. Roman Vershynin. High-dimensional probability: An introduction with applications in data science, volume 47. Cambridge university press, 2018.
  76. Dynamic covariate balancing: estimating treatment effects over time. arXiv preprint arXiv:2103.01280, 2021.
  77. Minimal dispersion approximately balancing weights: asymptotic properties and practical considerations. Biometrika, 107(1):93–105, 2020.
  78. Jeffrey M Wooldridge. Econometric analysis of cross section and panel data. MIT press, 2010.
  79. Optimal experimental design for staggered rollouts. Management Science, 2023.
  80. José R Zubizarreta. Stable weights that balance covariates for estimation with incomplete outcome data. Journal of the American Statistical Association, 110(511):910–922, 2015.
Citations (5)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 3 tweets with 31 likes about this paper.