OpenBias: Open-set Bias Detection in Text-to-Image Generative Models
Abstract: Text-to-image generative models are becoming increasingly popular and accessible to the general public. As these models see large-scale deployments, it is necessary to deeply investigate their safety and fairness to not disseminate and perpetuate any kind of biases. However, existing works focus on detecting closed sets of biases defined a priori, limiting the studies to well-known concepts. In this paper, we tackle the challenge of open-set bias detection in text-to-image generative models presenting OpenBias, a new pipeline that identifies and quantifies the severity of biases agnostically, without access to any precompiled set. OpenBias has three stages. In the first phase, we leverage a LLM to propose biases given a set of captions. Secondly, the target generative model produces images using the same set of captions. Lastly, a Vision Question Answering model recognizes the presence and extent of the previously proposed biases. We study the behavior of Stable Diffusion 1.5, 2, and XL emphasizing new biases, never investigated before. Via quantitative experiments, we demonstrate that OpenBias agrees with current closed-set bias detection methods and human judgement.
- Does data repair lead to fair models? curating contextually fair data to reduce model bias. In WACV, 2022.
- Spatext: Spatio-textual representation for controllable image generation. In CVPR, 2023.
- Easily accessible text-to-image generation amplifies demographic stereotypes at large scale. In Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, 2023.
- Man is to computer programmer as woman is to homemaker? debiasing word embeddings. In NeurIPS, 2016.
- On the opportunities and risks of foundation models. arXiv preprint, 2021.
- Sega: Instructing text-to-image models using semantic guidance. In NeurIPS, 2023a.
- Mitigating inappropriateness in image generation: Can there be value in reflecting the world’s ugliness? arXiv preprint, 2023b.
- Instructpix2pix: Learning to follow image editing instructions. In CVPR, 2023.
- Language models are few-shot learners. In NeurIPS, 2020.
- Emerging properties in self-supervised vision transformers. In ICCV, 2021.
- Video chatcaptioner: Towards the enriched spatiotemporal descriptions. arXiv preprint, 2023.
- Reproducible scaling laws for contrastive language-image learning. In CVPR, 2023.
- Dall-eval: Probing the reasoning skills and social biases of text-to-image generation models. In ICCV, 2023.
- Improving fairness using vision-language driven image augmentation. In WACV, 2024.
- An image is worth 16x16 words: Transformers for image recognition at scale. In ICLR, 2021.
- Diffusion self-guidance for controllable image generation. In NeurIPS, 2023.
- Fair diffusion: Instructing text-to-image generation models on fairness. arXiv preprint, 2023.
- An image is worth one word: Personalizing text-to-image generation using textual inversion. arXiv preprint, 2022.
- Bias and fairness in large language models: A survey. arXiv preprint, 2023.
- Unified concept editing in diffusion models. In WACV, 2024.
- Pair-diffusion: A comprehensive multimodal object-level image editor. arXiv preprint, 2023.
- Visual programming: Compositional visual reasoning without training. In CVPR, 2023.
- Women also snowboard: Overcoming bias in captioning models. In ECCV, 2018.
- Prompt-to-prompt image editing with cross attention control. In ICLR, 2022.
- Classifier-free diffusion guidance. In NeurIPS 2021 Workshop on Deep Generative Models and Downstream Applications, 2021.
- Promptcap: Prompt-guided image captioning for vqa with gpt-3. In ICCV, 2023a.
- Tifa: Accurate and interpretable text-to-image faithfulness evaluation with question answering. In ICCV, 2023b.
- Composer: Creative and controllable image synthesis with composable conditions. In ICML, 2023.
- Learning fair classifiers with partially annotated group labels. In CVPR, 2022.
- Fairface: Face attribute dataset for balanced race, gender, and age for bias measurement and mitigation. In WACV, 2021.
- A style-based generator architecture for generative adversarial networks. In CVPR, 2019.
- Alias-free generative adversarial networks. In NeurIPS, 2021.
- Repfair-gan: Mitigating representation bias in gans using gradient clipping. arXiv preprint, 2022.
- Vilt: Vision-and-language transformer without convolution or region supervision. In ICML, 2021.
- Bias-to-text: Debiasing unknown visual biases through language interpretation. arXiv preprint, 2023.
- The india face set: International and cultural boundaries impact face impressions and perceptions of category membership. Frontiers in Psychology, 2021.
- mPLUG: Effective and efficient vision-language learning by cross-modal skip-connections. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022a.
- Blip: Bootstrapping language-image pre-training for unified vision-language understanding and generation. In ICML, 2022b.
- BLIP-2: Bootstrapping language-image pre-training with frozen image encoders and large language models. In Proceedings of the 40th International Conference on Machine Learning, 2023.
- Microsoft COCO: common objects in context. In ECCV, 2014.
- Improved baselines with visual instruction tuning. In NeurIPS 2023 Workshop on Instruction Tuning and Instruction Following, 2023a.
- Visual instruction tuning. In NeurIPS, 2023b.
- The chicago face database: A free stimulus set of faces and norming data. 2015.
- Chicago face database: Multiracial expansion. Behavior Research Methods, 2020.
- Social biases through the text-to-image generation lens. In Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 2023.
- Learning from failure: De-biasing classifier from biased classifier. NeurIPS, 2020.
- Biases in large language models: Origins, inventory, and discussion. ACM Journal of Data and Information Quality, 2023.
- Glide: Towards photorealistic image generation and editing with text-guided diffusion models. In International Conference on Machine Learning, 2022.
- Dinov2: Learning robust visual features without supervision. In Transactions on Machine Learning Research, 2023.
- SDXL: Improving latent diffusion models for high-resolution image synthesis. In ICLR, 2024.
- Learning transferable visual models from natural language supervision. In ICML, 2021.
- Hierarchical text-conditional image generation with clip latents. arXiv preprint, 2022.
- High-resolution image synthesis with latent diffusion models. In CVPR, 2022.
- Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation. In CVPR, 2023.
- Photorealistic text-to-image diffusion models with deep language understanding. NeurIPS, 2022.
- Intra-processing methods for debiasing neural networks. In NeurIPS, 2020.
- Conceptnet 5.5: An open multilingual graph of general knowledge. In AAAI, 2017.
- Selective annotation makes language models better few-shot learners. In ICLR, 2023.
- Unbiased image synthesis via manifold-driven sampling in diffusion models. arXiv preprint, 2023.
- Reclip: A strong zero-shot baseline for referring expression comprehension. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022.
- Vipergpt: Visual inference via python execution for reasoning. In ICCV, 2023.
- Improving the fairness of deep generative models without retraining. arXiv preprint, 2021.
- Llama: Open and efficient foundation language models. arXiv preprint, 2023.
- Fairness definitions explained. In Proceedings of the international workshop on software fairness, 2018.
- Git: A generative image-to-text transformer for vision and language. In Transactions on Machine Learning Research, 2022a.
- Ofa: Unifying architectures, tasks, and modalities through a simple sequence-to-sequence learning framework. In ICML, 2022b.
- Towards fairness in visual recognition: Effective strategies for bias mitigation. In CVPR, 2020.
- Chain-of-thought prompting elicits reasoning in large language models. In NeurIPS, 2022.
- Allen R Wilcox. Indices of qualitative variation. Technical report, Oak Ridge National Lab., Tenn., 1967.
- Fairgan: Fairness-aware generative adversarial networks. In 2018 IEEE International Conference on Big Data (Big Data). IEEE, 2018.
- From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. Transactions of the Association for Computational Linguistics, 2014.
- Iti-gen: Inclusive text-to-image generation. In ICCV, 2023a.
- Adding conditional control to text-to-image diffusion models. In ICCV, 2023b.
- Men also like shopping: Reducing gender bias amplification using corpus-level constraints. In EMNLP, 2017.
- Gender bias in coreference resolution: Evaluation and debiasing methods. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018.
- Chatgpt asks, blip-2 answers: Automatic questioning towards enriched visual descriptions. In Transactions on Machine Learning Research, 2023.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.