Papers
Topics
Authors
Recent
Search
2000 character limit reached

ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation

Published 23 Feb 2024 in cs.CV, cs.AI, and cs.LG | (2402.15429v2)

Abstract: Text-to-Image (T2I) Diffusion Models (DMs) have shown impressive abilities in generating high-quality images based on simple text descriptions. However, as is common with many Deep Learning (DL) models, DMs are subject to a lack of robustness. While there are attempts to evaluate the robustness of T2I DMs as a binary or worst-case problem, they cannot answer how robust in general the model is whenever an adversarial example (AE) can be found. In this study, we first introduce a probabilistic notion of T2I DMs' robustness; and then establish an efficient framework, ProTIP, to evaluate it with statistical guarantees. The main challenges stem from: i) the high computational cost of the generation process; and ii) determining if a perturbed input is an AE involves comparing two output distributions, which is fundamentally harder compared to other DL tasks like classification where an AE is identified upon misprediction of labels. To tackle the challenges, we employ sequential analysis with efficacy and futility early stopping rules in the statistical testing for identifying AEs, and adaptive concentration inequalities to dynamically determine the "just-right" number of stochastic perturbations whenever the verification target is met. Empirical experiments validate the effectiveness and efficiency of ProTIP over common T2I DMs. Finally, we demonstrate an application of ProTIP to rank commonly used defence methods.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (7)
  1. Midjourney. https://www.midjourney.com/
  2. Aminifar, A.: Universal adversarial perturbations in epileptic seizure detection. In: 2020 International Joint Conference on Neural Networks (IJCNN). pp. 1–6. IEEE (2020)
  3. Fort, S.: Pixels still beat text: attacking the openai clip model with text patches and adversarial pixel perturbations. Stanislav Fort [Internet] 5 (2021)
  4. Hoeffding, W.: Probability inequalities for sums of bounded random variables. The collected works of Wassily Hoeffding pp. 409–426 (1994)
  5. Lakens, D.: Improving Your Statistical Inferences. https://lakens.github.io/statistical_inferences/ (2022)
  6. Norvig, P.: pyspellchecker: A spell checker for python. GitHub repository (2024)
  7. Prithivida: Gramformer: A library for a family of algorithms to detect, highlight and correct grammar errors. GitHub repository (2021)
Citations (2)

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.