Fairness without Demographics through Adversarially Reweighted Learning

Published 23 Jun 2020 in cs.LG and stat.ML | (2006.13114v3)

Abstract: Much of the previous ML fairness literature assumes that protected features such as race and sex are present in the dataset, and relies upon them to mitigate fairness concerns. However, in practice factors like privacy and regulation often preclude the collection of protected features, or their use for training or inference, severely limiting the applicability of traditional fairness research. Therefore we ask: How can we train an ML model to improve fairness when we do not even know the protected group memberships? In this work we address this problem by proposing Adversarially Reweighted Learning (ARL). In particular, we hypothesize that non-protected features and task labels are valuable for identifying fairness issues, and can be used to co-train an adversarial reweighting approach for improving fairness. Our results show that {ARL} improves Rawlsian Max-Min fairness, with notable AUC improvements for worst-case protected groups in multiple datasets, outperforming state-of-the-art alternatives.

Abstract PDF Upgrade to Chat

Citations (300)

View on Semantic Scholar

Summary

The paper introduces ARL, a technique that enhances fairness by reweighting training errors without relying on protected demographic features.
It utilizes a minimax game between a learner and an adversary to focus on regions where performance gaps indicate potential bias.
Empirical studies on datasets like UCI Adult and COMPAS show ARL significantly improves worst-case AUC for underrepresented groups without compromising overall model utility.

Fairness without Demographics through Adversarially Reweighted Learning

The paper presents a novel framework called Adversarially Reweighted Learning (ARL) for addressing fairness in ML models without requiring access to protected demographic features (e.g., race, gender) during training or inference. Most existing fairness approaches assume that these features are accessible, which is often not the case due to privacy regulations and ethical considerations. This research aims to bridge that gap by leveraging the correlations between non-protected features, task labels, and fairness concerns.

Problem Context and Motivation

In high-stakes applications such as healthcare and finance, models tend to exhibit biases due to disparities in training data. Traditional fairness approaches rely heavily on protected features to mitigate these biases. However, demographic information is either unavailable or cannot be legally used, posing a significant challenge for fairness implementations in practice. The work is motivated by policy frameworks like GDPR, which impose strict restrictions on the use of demographic data but still enforce fairness requirements. Consequently, the paper addresses the question: How can ML models be trained to improve fairness in the absence of protected group data?

Proposed Method: Adversarially Reweighted Learning

ARL introduces an innovative approach that employs adversarial reweighting to address fairness without demographics. The principle behind ARL hinges on optimizing model performance for regions of the feature and label space where errors indicate potential fairness issues. This is achieved through a minimax game setup between two entities: the learner and an adversary. The adversary is optimized to assign higher weights to computationally identifiable areas of errors, thus indirectly improving worst-case performance across unobserved protected groups. Unlike standard distributionally robust optimization (DRO) approaches, ARL's focus on computationally identifiable errors mitigates the risk of overfitting to noisy outliers.

Empirical Results

ARL demonstrates superior performance in various fairness metrics across several datasets, such as UCI Adult, LSAC, and COMPAS, where it consistently enhances Rawlsian Max-Min fairness. Notably, ARL improves the minimum AUC for worst-case protected groups notably when compared to other state-of-the-art methods, including traditional DRO techniques. It is particularly robust to disparities in group representation and is less susceptible to performance degradation from noisy labels, a common pitfall in fairness optimization algorithms.

Quantitatively, ARL showcases an improvement in AUC for underrepresented groups while maintaining or enhancing the overall AUC for the dataset, indicating that fairness improvements do not necessarily have to compromise model utility. This subtly challenges the perceived trade-off between fairness and performance, presenting ARL as a potential framework that achieves a more balanced allocation of model efficiency and fairness.

Theoretical and Practical Implications

Theoretically, ARL underscores the importance of computationally identifiable signals within datasets as a proxy for demographic information. It enriches the understanding of fairness without explicit demographic data by demonstrating the feasibility of addressing biases through correlations inherent in non-protected features and labels. Practically, ARL paves the way for more inclusive ML systems that adhere to privacy and legal constraints while ensuring equitable treatment across all user demographics. This framework is poised to have significant implications for real-world applications wherein demographic features are either unavailable or ethically contentious, offering a versatile solution under such constraints.

Future Directions

The research opens several avenues, particularly in extending the adversarial frameworks to more complex scenarios with varied data distributions and more granular subgroups. There is also potential in integrating ARL with other debiasing techniques for improved robustness and generalization. Furthermore, exploring dynamic learning rates within the adversarial setup and optimizing computational resources for large-scale applications warrant further investigation.

In conclusion, this paper provides a rigorous approach to ensuring fairness in machine learning without demographics, expanding both the theoretical landscape and applicability of fairness in practice. ARL stands as a promising direction for advancing ethical AI in alignment with modern privacy regulations.