Practical Privacy-Preserving Gaussian Process Regression via Secret Sharing
Abstract: Gaussian process regression (GPR) is a non-parametric model that has been used in many real-world applications that involve sensitive personal data (e.g., healthcare, finance, etc.) from multiple data owners. To fully and securely exploit the value of different data sources, this paper proposes a privacy-preserving GPR method based on secret sharing (SS), a secure multi-party computation (SMPC) technique. In contrast to existing studies that protect the data privacy of GPR via homomorphic encryption, differential privacy, or federated learning, our proposed method is more practical and can be used to preserve the data privacy of both the model inputs and outputs for various data-sharing scenarios (e.g., horizontally/vertically-partitioned data). However, it is non-trivial to directly apply SS on the conventional GPR algorithm, as it includes some operations whose accuracy and/or efficiency have not been well-enhanced in the current SMPC protocol. To address this issue, we derive a new SS-based exponentiation operation through the idea of 'confusion-correction' and construct an SS-based matrix inversion algorithm based on Cholesky decomposition. More importantly, we theoretically analyze the communication cost and the security of the proposed SS-based operations. Empirical results show that our proposed method can achieve reasonable accuracy and efficiency under the premise of preserving data privacy.
- Deep learning with differential privacy. In Proc. of the ACM SIGSAC conference on computer and communications security, pages 308–318, 2016.
- Secure computation on floating point numbers. Cryptology ePrint Archive, 2012.
- Donald Beaver. Efficient multiparty protocols using circuit randomization. In Annual International Cryptology Conference, pages 420–432. Springer, 1991.
- George Robert Blakley. Safeguarding cryptographic keys. In International Workshop on Managing Requirements Knowledge, 1979.
- Ran Canetti. Universally composable security: A new paradigm for cryptographic protocols. In Proc. IEEE Symposium on Foundations of Computer Science, pages 136–145, 2001.
- Gaussian process decentralized data fusion and active sensing for spatiotemporal traffic modeling and prediction in mobility-on-demand systems. IEEE Transactions on Automation Science and Engineering, 12(3):901–921, 2015.
- Federated Bayesian optimization via Thompson sampling. In Proc. NeurIPS, pages 9687–9699, 2020.
- Differentially private federated Bayesian optimization with distributed exploration. In Proc. NeurIPS, 2021.
- Cynthia Dwork. Differential privacy. In Proc. ICALP, pages 1–12, 2006.
- The algorithmic foundations of differential privacy. Found. Trends Theor. Comput. Sci., 9(3-4):211–407, 2014.
- Least angle regression. The Annals of statistics, 32(2):407–499, 2004.
- A pragmatic introduction to secure multi-party computation. Foundations and Trends® in Privacy and Security, 2(2-3):70–246, 2018.
- Privacy-preserving Gaussian process regression – A modular approach to the application of homomorphic encryption. In Proc. AAAI, pages 3866–3873, 2020.
- Craig Gentry. Fully homomorphic encryption using ideal lattices. In Proc. annual ACM symposium on Theory of computing, pages 169–178, 2009.
- Shafi Goldwasser. How to play any mental game, or a completeness theorem for protocols with an honest majority. In Proc. ACM STOC, pages 218–229, 1987.
- Secure Poisson regression. In Proc. USENIX Security, pages 791–808, 2022.
- Private outsourced Bayesian optimization. In Proc. ICML, pages 5231–5242, 2020.
- CrypTen: Secure multi-party computation meets machine learning. In Proc. NeurIPS, 2021.
- Federated learning: Strategies for improving communication efficiency. arXiv preprint arXiv:1610.05492, 2016.
- Fully decentralized, scalable Gaussian processes for multi-agent federated learning. arXiv preprint arXiv:2203.02865, 2022.
- Oblivious neural network predictions via MiniONN transformations. In Proc. CCS, pages 619–631, 2017.
- ABY3: A mixed protocol framework for machine learning. In Proc. CCS, pages 35–52, 2018.
- SecureML: A system for scalable privacy-preserving machine learning. In IEEE Symposium on Security and Privacy, pages 19–38, 2017.
- Automated insulin delivery for type 1 diabetes mellitus patients using Gaussian process-based model predictive control. In Proc. ACC, pages 4118–4123, 2019.
- Gaussian processes for machine learning. MIT Press, 2006.
- AriaNN: Low-interaction privacy-preserving deep learning via function secret sharing. Proc. on Privacy Enhancing Technologies, 2022(1):291–316, 2020.
- Adi Shamir. How to share a secret. Communications of the ACM, 22(11):612–613, 1979.
- Gaussian process-based kernel as a diagnostic model for prediction of type 2 diabetes mellitus risk using non-linear heart rate variability features. Biomedical Engineering Letters, 11(3):273–286, 2021.
- Differentially private regression with Gaussian processes. In Proc. AISTATS, pages 1195–1203, 2018.
- SecureNN: 3-Party secure computation for neural network training. Proc. on Privacy Enhancing Technologies, 2019(3):26–49, 2019.
- Bayesian nonparametric models for multiway data analysis. IEEE transactions on pattern analysis and machine intelligence, 37(2):475–487, 2015.
- Sparse matrix-variate Gaussian process blockmodels for network modeling. In Proc. UAI, pages 745–752, 2011.
- Gaussian process-based algorithmic trading strategy identification. Quantitative Finance, 15(10):1683–1703, 2015.
- Andrew Chi-Chih Yao. Protocols for secure computations. In Annual Symposium on Foundations of Computer Science, pages 160–164, 1982.
- Andrew Chi-Chih Yao. How to generate and exchange secrets. In Annual Symposium on Foundations of Computer Science, pages 162–167, 1986.
- A comprehensive survey of privacy-preserving federated learning: A taxonomy, review, and future directions. ACM Computing Surveys (CSUR), 54(6):1–36, 2021.
- Federated Gaussian process: Convergence, automatic personalization and multi-fidelity modeling. arXiv preprint arXiv:2111.14008, 2021.
- No free lunch theorem for security and utility in federated learning. ACM Transactions on Intelligent Systems and Technology, 14(1):1–35, 2022.
- Near-optimal active learning of multi-output Gaussian processes. In Proc. AAAI, pages 2351–2357, 2016.
- iDLG: Improved deep leakage from gradients. arXiv preprint arXiv:2001.02610, 2020.
- Scalable nonparametric multiway data analysis. In Guy Lebanon and S. V. N. Vishwanathan, editors, Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2015, San Diego, California, USA, May 9-12, 2015, 2015.
- DinTucker: Scaling up Gaussian process models on large multidimensional arrays. In Proc. AAAI, 2016.
- Deep leakage from gradients. In Proc. NeurIPS, 2019.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.