Training Differentially Private Ad Prediction Models with Semi-Sensitive Features
Abstract: Motivated by problems arising in digital advertising, we introduce the task of training differentially private (DP) machine learning models with semi-sensitive features. In this setting, a subset of the features is known to the attacker (and thus need not be protected) while the remaining features as well as the label are unknown to the attacker and should be protected by the DP guarantee. This task interpolates between training the model with full DP (where the label and all features should be protected) or with label DP (where all the features are considered known, and only the label should be protected). We present a new algorithm for training DP models with semi-sensitive features. Through an empirical evaluation on real ads datasets, we demonstrate that our algorithm surpasses in utility the baselines of (i) DP stochastic gradient descent (DP-SGD) run on all features (known and unknown), and (ii) a label DP algorithm run only on the known features (while discarding the unknown ones).
- Deep learning with differential privacy. In CCS, 308–318.
- Android. 2023a. Attribution Reporting. https://developer.android.com/design-for-safety/privacy-sandbox/attribution.
- Android. 2023b. Protected Audience API on Android developer guide. Https://developer.android.com/design-for-safety/privacy-sandbox/guides/protected-audience.
- Sample complexity bounds for differentially private learning. In COLT, 155–186.
- Private Ad Modeling with DP-SGD. In AdKDD.
- Attribution modeling increases efficiency of bidding in display advertising. In AdKDD, 1–6.
- Protected Audience API: On-device ad auctions to serve remarketing and custom audiences, without cross-site third-party tracking. https://developer.chrome.com/docs/privacy-sandbox/protected-audience.
- Our Data, Ourselves: Privacy Via Distributed Noise Generation. In EUROCRYPT, 486–503.
- Calibrating Noise to Sensitivity in Private Data Analysis. In TCC, 265–284.
- The algorithmic foundations of differential privacy. Foundations and Trends® in Theoretical Computer Science, 9(3–4): 211–407.
- Deep learning with label differential privacy. NeurIPS, 27131–27145.
- Display Advertising Challenge.
- Private Learning with Public Features. arXiv preprint arXiv:2310.15454.
- SGDR: Stochastic gradient descent with warm restarts. In ICLR.
- Antipodes of label differential privacy: PATE and ALIBI. NeurIPS, 34: 6934–6945.
- Microsoft. 2021. MaskedLARk. https://github.com/microsoft/maskedlark.
- Mironov, I. 2017. Rényi Differential Privacy. In CSF, 263–275.
- Attribution Reporting. https://developer.chrome.com/en/docs/privacy-sandbox/attribution-reporting/.
- Masked LARk: Masked learning, aggregation and reporting workflow. arXiv preprint arXiv:2110.14794.
- Schuh, J. 2020. Building a more private web: A path towards making third party cookies obsolete. https://blog.chromium.org/2020/01/building-more-private-web-path-towards.html.
- Classification with Partially Private Features. arXiv preprint arXiv:2312.07583.
- Thomson, M. 2022. Privacy Preserving Attribution for Advertising. https://blog.mozilla.org/en/mozilla/privacy-preserving-attribution-for-advertising/.
- Vadhan, S. P. 2017. The Complexity of Differential Privacy. In Tutorials on the Foundations of Cryptography, 347–450. Springer International Publishing.
- Warner, S. L. 1965. Randomized response: a survey technique for eliminating evasive answer bias. Journal of the American Statistical Association, 60 309: 63–69.
- Wilander, J. 2020. Full Third-Party Cookie Blocking and More. https://webkit.org/blog/10218/full-third-party-cookie-blocking-and-more/.
- Wilander, J. 2021. Introducing Private Click Measurement, PCM. https://webkit.org/blog/11529/introducing-private-click-measurement-pcm/.
- Winstrom, L. 2023. A proposal for privacy preserving ad attribution measurement using Prio-like architecture. https://github.com/patcg/proposals/issues/17.
- Wood, M. 2019. Today’s Firefox Blocks Third-Party Tracking Cookies and Cryptomining by Default. https://blog.mozilla.org/en/products/firefox/todays-firefox-blocks-third-party-tracking-cookies-and-cryptomining-by-default/.
- Adaptive methods for nonconvex optimization. In NIPS.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.