Papers
Topics
Authors
Recent
Search
2000 character limit reached

Privacy-Protected Spatial Autoregressive Model

Published 25 Mar 2024 in stat.ME and econ.EM | (2403.16773v2)

Abstract: Spatial autoregressive (SAR) models are important tools for studying network effects. However, with an increasing emphasis on data privacy, data providers often implement privacy protection measures that make classical SAR models inapplicable. In this study, we introduce a privacy-protected SAR model with noise-added response and covariates to meet privacy-protection requirements. However, in this scenario, the traditional quasi-maximum likelihood estimator becomes infeasible because the likelihood function cannot be directly formulated. To address this issue, we first consider an explicit expression for the likelihood function with only noise-added responses. Then, we develop techniques to correct the biases for derivatives introduced by noise. Correspondingly, a Newton-Raphson-type algorithm is proposed to obtain the estimator, leading to a corrected likelihood estimator. To further enhance computational efficiency, we introduce a corrected least squares estimator based on the idea of bias correction. These two estimation methods ensure both data security and the attainment of statistically valid estimators. Theoretical analysis of both estimators is carefully conducted, statistical inference methods and model extensions are discussed. The finite sample performances of different methods are demonstrated through extensive simulations and the analysis of a real dataset.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (29)
  1. Anselin, L., Le Gallo, J., and Jayet, H. (2008), “Spatial panel econometrics,” in The econometrics of panel data, Springer, pp. 625–660.
  2. Barabási, A.-L. and Albert, R. (1999), “Emergence of scaling in random networks,” science, 286, 509–512.
  3. Chen, X., Chen, Y., and Xiao, P. (2013), “The impact of sampling and network topology on the estimation of social intercorrelations,” Journal of Marketing Research, 50, 95–110.
  4. Clauset, A., Shalizi, C. R., and Newman, M. E. (2009), “Power-law distributions in empirical data,” SIAM review, 51, 661–703.
  5. Dinur, I. and Nissim, K. (2003), “Revealing information while preserving privacy,” In Proceedings of the Twenty-second ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems,PODS 03, 202–210.
  6. Dwork, C., McSherry, F., Nissim, K., and Smith, A. (2006), “Calibrating Noise to Sensitivity in Private Data Analysis,” In Theory of Cryptography, 265–284.
  7. Hu, J., Reiter, J. P., and Wang, Q. (2018), “Dirichlet process mixture models for modeling and generating synthetic versions of nested categorical data,” Bayesian Analysis, 183–200.
  8. Huang, D., Lan, W., Zhang, H., and Wang, H. (2019), “Least squares estimation of spatial autoregressive models for large-scale social networks,” Electronic Journal of Statistics, 13, 1135–1165.
  9. Huang, D., Wang, F., Zhu, X., and Wang, H. (2020), “Two-mode network autoregressive model for large-scale networks,” Journal of Econometrics, 216, 203–219.
  10. Ito, K., Kawano, Y., and Kashima, K. (2021), “Privacy protection with heavy-tailed noise for linear dynamical systems,” Automatica, 109732.
  11. Lee, L.-f. and Yu, J. (2010), “Estimation of spatial autoregressive panel data models with fixed effects,” Journal of Econometrics, 154, 165–185.
  12. — (2014), “Efficient GMM estimation of spatial dynamic panel data models with fixed effects,” Journal of Econometrics, 180, 174–197.
  13. Li, K. (2017), “Fixed-effects dynamic spatial panel data models and impulse response analysis,” Journal of Econometrics, 198, 102–121.
  14. Malikova, E. and Sun, Y. (2017), “Semiparametric estimation and testing of smooth coefficient spatial autoregressive models,” Journal of Econometrics, 199, 12–34.
  15. Nowicki, K. and Snijders, T. A. B. (2001), “Estimation and prediction for stochastic blockstructures,” Journal of the American statistical association, 96, 1077–1087.
  16. Ord, K. (1975), “Estimation methods for models of spatial interaction,” Journal of the American Statistical Association, 70, 120–126.
  17. Pace, R. K., Barry, R., Gilley, O. W., and Sirmans, C. (2000), “A method for spatial–temporal forecasting with an application to real estate prices,” International Journal of Forecasting, 16, 229–246.
  18. Quick, H. (2021), “Generating Poisson-distributed differentially private synthetic data,” Journal of Royal Statistical Society: Series A (Statistics in Society), 184, 1093–1108.
  19. Quick, H. and Waller, L. A. (2018), “Using spatiotemporal models to generate synthetic data for public use,” Spatial and Spatio-Temporal Epidemiology, 37–45.
  20. Raghunathan, T. E., Reiter, J. P., and Rubin, D. B. (2003), “Multiple imputation for statistical disclosure limitation,” Journal of Official Statistics, 1–16.
  21. Reiter, J. P. (2005), “Using CART to generate partially synthetic, public use microdata,” Journal of Official Statistics, 441–462.
  22. Su, L. (2012), “Semiparametric GMM estimation of spatial autoregressive models,” Journal of Econometrics, 167, 543–560.
  23. Wang, Y. J. and Wong, G. Y. (1987), “Stochastic blockmodels for directed graphs,” Journal of the American Statistical Association, 82, 8–19.
  24. Wilde, H., Jewson, J., Vollmer, S., and Holmes, C. (2021), “Foundations of Bayesian Learning from Synthetic Data,” In International Conference on Artificial Intelligence and Statistics. PMLR, 541–549.
  25. Yang, K. and Lee, L.-f. (2017), “Identification and QML estimation of multivariate and simultaneous equations spatial autoregressive models,” Journal of econometrics, 196, 196–214.
  26. Yu, J., De Jong, R., and Lee, L.-f. (2008), “Quasi-maximum likelihood estimators for spatial dynamic panel data with fixed effects when both n and T are large,” Journal of Econometrics, 146, 118–134.
  27. Zhou, J., Tu, Y., Chen, Y., and Wang, H. (2017), “Estimating spatial autocorrelation with sampled network data,” Journal of Business & Economic Statistics, 35, 130–138.
  28. Zhu, X., Huang, D., Pan, R., and Wang, H. (2020), “Multivariate spatial autoregressive model for large scale social networks,” Journal of Econometrics, 215, 591–606.
  29. Zhu, X., Pan, R., Li, G., Liu, Y., Wang, H., et al. (2017), “Network vector autoregression,” The Annals of Statistics, 45, 1096–1123.

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 2 tweets with 0 likes about this paper.