SVARM-IQ: Efficient Approximation of Any-order Shapley Interactions through Stratification

Published 24 Jan 2024 in cs.GT | (2401.13371v3)

Abstract: Addressing the limitations of individual attribution scores via the Shapley value (SV), the field of explainable AI (XAI) has recently explored intricate interactions of features or data points. In particular, extensions of the SV, such as the Shapley Interaction Index (SII), have been proposed as a measure to still benefit from the axiomatic basis of the SV. However, similar to the SV, their exact computation remains computationally prohibitive. Hence, we propose with SVARM-IQ a sampling-based approach to efficiently approximate Shapley-based interaction indices of any order. SVARM-IQ can be applied to a broad class of interaction indices, including the SII, by leveraging a novel stratified representation. We provide non-asymptotic theoretical guarantees on its approximation quality and empirically demonstrate that SVARM-IQ achieves state-of-the-art estimation results in practical XAI scenarios on different model classes and application domains.

Abstract PDF HTML Upgrade to Chat

References (49)

Citations (3)

View on Semantic Scholar

Summary

The paper presents a novel stratified sampling method to accurately approximate any-order Shapley interactions in cooperative games.
It leverages discrete derivative stratification to reduce computational costs and delivers unbiased estimates with lower mean squared error.
Empirical evaluations on language models, ViTs, and CNNs demonstrate its superior speed and accuracy over permutation-based and kernel methods.

SVARM-IQ: Efficient Approximation of Any-order Shapley Interactions through Stratification

Introduction

The paper introduces SVARM-IQ, a novel approach for the efficient approximation of Shapley-based interaction indices, leveraging a stratified sampling method. This approach addresses the computational challenges inherent in calculating Shapley interaction indices due to their combinatorial complexity. The method is particularly applicable to cooperative games within the context of Explainable AI (XAI), where understanding the contribution of feature interactions in machine learning models is crucial.

Shapley-Based Interaction Indices

Shapley values are foundational in cooperative game theory and are extended to capture interactions between features through Shapley Interaction Indices (SIIs). These indices are essential in XAI for interpreting complex models where feature interactions play a significant role, as opposed to isolated feature attributions, which may miss crucial insights in real-world applications with high feature correlations.

Methodology: SVARM-IQ

SVARM-IQ innovates by using a stratified representation to estimate any-order Shapley interactions. The method divides the estimation process into manageable strata, each representing subsets of the data where the interaction effects are calculated separately. This stratification optimizes the approximation accuracy and computational efficiency.

Stratified Representation

The core innovation lies in stratifying the discrete derivatives that define the interaction indices, treating each stratum according to coalition size, and leveraging this structured sampling to produce unbiased estimates of interaction effects.

Algorithmic Implementation

SVARM-IQ operates by:

Sampling-based Strategy: Sampling coalitions to determine the contribution of interactions.
Strata-based Updates: Updating interaction estimates based on sampled coalitions that fit into defined strata, enhancing the reuse of samples across interaction calculations.
Multi-order Estimation: Enabling simultaneous estimation of multiple interaction orders, facilitating comprehensive insight into model behavior.

Theoretical Analysis

The paper provides theoretical guarantees on the unbiasedness and variance bounds of the SVARM-IQ estimates. It demonstrates that the method significantly reduces the expected mean squared error (MSE) compared to existing approaches by optimizing the sampling strategy within the stratified framework.

Empirical Evaluation

SVARM-IQ outperforms state-of-the-art techniques, such as permutation sampling and kernel-based approaches, in terms of both speed and accuracy, as demonstrated through various XAI tasks across LLMs (LM), vision transformers (ViT), and convolutional neural networks (CNN).

Figure 1: By dividing an ImageNet picture into multiple patches, attribution scores for single patches and interactions scores for pairs aid explaining a vision transformer.

Comparison with Baselines

The empirical results highlight that SVARM-IQ rapidly achieves precise approximation with fewer model evaluations. This efficiency is particularly beneficial in scenarios where computational resources are limited, and quick, reliable explanations are needed.

Figure 2: Schematic overview of SVARM-IQ.

Conclusion

SVARM-IQ significantly advances the field of Shapley-based interaction approximation by providing a scalable, efficient, and unbiased method applicable to a wide range of XAI scenarios. Its ability to generalize across various types of Shapley interactions and cooperative games makes it a versatile tool for practitioners seeking to interpret complex machine learning models.

Overall, SVARM-IQ represents a meaningful step forward in making complex model interactions interpretable, enabling practitioners to derive deeper insights from machine learning models while maintaining computational efficiency. Future research could explore further optimization of stratification parameters and extending the method to other forms of interaction indices beyond those currently captured.