Papers
Topics
Authors
Recent
Search
2000 character limit reached

Local and Global Explanation Methods

Updated 26 January 2026
  • Local and global explanation methods are formal approaches that decompose model predictions into individual feature contributions and interaction effects for enhanced interpretability.
  • They link partial dependence plots and interventional SHAP values through an ANOVA-style functional decomposition, clarifying both local and global model behavior.
  • They support fairness interventions by enabling the precise removal of protected feature influences, thereby isolating and mitigating bias in model predictions.

Local and global explanation methods are formal approaches used in interpretable machine learning to elucidate how complex models make decisions. Local explanation methods characterize the prediction for an individual instance, typically by attributing contributions to features relevant for that point. Global explanation methods aim to summarize how a model behaves on average, across the entire input domain, by decomposing its prediction function or aggregating local mechanisms. The distinction, and the methodologies that bridge it, are central to modern explainable AI, spanning regression, classification, time-series, text, and vision domains.

1. Formal Decomposition and Marginal Identification

Functional decomposition is a rigorous approach to unify local and global explanation methods by expressing any prediction function f:Rd→Rf : \mathbb{R}^d \to \mathbb{R} as a sum over all feature subsets:

f(x)=∑S⊆{1,…,d}fS(xS)f(x) = \sum_{S \subseteq \{1,\ldots,d\}} f_S(x_S)

Here, f∅=f0f_\emptyset=f_0 (intercept), f{k}(xk)f_{\{k\}}(x_k) encodes the main effect of feature kk, and fS(xS)f_S(x_S) for ∣S∣≥2|S|\ge2 encodes pure interactions. This ANOVA-style expansion is not unique unless an additional identification constraint is imposed.

The marginal-identification constraint requires that for all SS,

∑T:T∩S≠∅∫fT(xT)pS(xS)dxS=0\sum_{T: T\cap S \neq \emptyset} \int f_T(x_T) p_S(x_S) dx_S = 0

where pSp_S is the marginal density of f(x)=∑S⊆{1,…,d}fS(xS)f(x) = \sum_{S \subseteq \{1,\ldots,d\}} f_S(x_S)0. This uniquely pins down each f(x)=∑S⊆{1,…,d}fS(xS)f(x) = \sum_{S \subseteq \{1,\ldots,d\}} f_S(x_S)1 via Möbius inversion:

f(x)=∑S⊆{1,…,d}fS(xS)f(x) = \sum_{S \subseteq \{1,\ldots,d\}} f_S(x_S)2

This identification recovers core local and global quantities. Specifically, for any singleton f(x)=∑S⊆{1,…,d}fS(xS)f(x) = \sum_{S \subseteq \{1,\ldots,d\}} f_S(x_S)3, the partial dependence plot (PDP) coincides with f(x)=∑S⊆{1,…,d}fS(xS)f(x) = \sum_{S \subseteq \{1,\ldots,d\}} f_S(x_S)4, while interventional SHAP values use weighted sums over main effect and interaction components (Hiabu et al., 2022).

2. Relationships with Core Attribution Methods

2.1 Partial Dependence and Feature Effects

The partial dependence plot for f(x)=∑S⊆{1,…,d}fS(xS)f(x) = \sum_{S \subseteq \{1,\ldots,d\}} f_S(x_S)5 is:

f(x)=∑S⊆{1,…,d}fS(xS)f(x) = \sum_{S \subseteq \{1,\ldots,d\}} f_S(x_S)6

Under marginal identification, this simplifies to f(x)=∑S⊆{1,…,d}fS(xS)f(x) = \sum_{S \subseteq \{1,\ldots,d\}} f_S(x_S)7. For any feature subset f(x)=∑S⊆{1,…,d}fS(xS)f(x) = \sum_{S \subseteq \{1,\ldots,d\}} f_S(x_S)8,

f(x)=∑S⊆{1,…,d}fS(xS)f(x) = \sum_{S \subseteq \{1,\ldots,d\}} f_S(x_S)9

2.2 Interventional SHAP Values

The interventional SHAP value for f∅=f0f_\emptyset=f_00:

f∅=f0f_\emptyset=f_01

Each interaction term f∅=f0f_\emptyset=f_02 that involves f∅=f0f_\emptyset=f_03 is apportioned f∅=f0f_\emptyset=f_04 of its value to feature f∅=f0f_\emptyset=f_05 (Hiabu et al., 2022). This recasting connects Shapley values, previously motivated through game theory, to a functional expansion, exposing the direct tie between local and global explanations.

2.3 Feature Importance Measures

Standard SHAP importance f∅=f0f_\emptyset=f_06 mixes main and interaction effects. The decomposition enables refined importance measures:

f∅=f0f_\emptyset=f_07

For order separation,

f∅=f0f_\emptyset=f_08

3. Algorithms for Exact and Approximate Functional Decomposition

The number of components f∅=f0f_\emptyset=f_09 is generally exponential in f{k}(xk)f_{\{k\}}(x_k)0, but many machine learning models admit low-dimensional base learners (e.g., tree ensembles with interaction order f{k}(xk)f_{\{k\}}(x_k)1). For such models, exact functional decomposition is feasible:

  • For each tree, identify the split set f{k}(xk)f_{\{k\}}(x_k)2.
  • For each f{k}(xk)f_{\{k\}}(x_k)3, recursively compute f{k}(xk)f_{\{k\}}(x_k)4—the marginal tree value as an expectation over unselected features.
  • For each f{k}(xk)f_{\{k\}}(x_k)5, assemble f{k}(xk)f_{\{k\}}(x_k)6 from f{k}(xk)f_{\{k\}}(x_k)7 via signed sums.
  • For XGBoost, Algorithm 3 describes efficient aggregation via f{k}(xk)f_{\{k\}}(x_k)8 tables; for random planted forests, grid-based marginals suffice.

Complexity is f{k}(xk)f_{\{k\}}(x_k)9 for kk0 trees of interaction-order kk1 and kk2 points (Hiabu et al., 2022).

4. Post-hoc De-biasing and Fairness Interventions

Bias removal targeting protected features relies on the additive decomposition. Consider kk3 as a block of features (e.g., gender) to eliminate from the model:

kk4

Thus, forming a debiased predictor entails simply dropping every component kk5 with kk6 from the sum. This approach removes both direct and indirect bias via protected features (Hiabu et al., 2022).

5. Key Experimental Results: Fidelity and Separation of Effects

Empirical studies verify the nuances of functional decomposition:

  • Toy problems: For kk7 with correlated features, standard interventional SHAP may mask main effects due to cancellation. Decomposition yields distinct main/interaction components kk8, kk9, fS(xS)f_S(x_S)0, which can be inspected separately.
  • Real data: On bike-sharing, decomposing fitted XGBoost reveals strong interaction terms (e.g., hourfS(xS)f_S(x_S)1workingday) that standard SHAP summary plots cannot separate.
  • Feature importance simulation: When interactions involve specific features (e.g., fS(xS)f_S(x_S)2), the decomposed importances precisely attribute effects by order.
  • De-biasing: When retraining on a subset of unprotected features, standard refitting fails to remove indirect bias. Decomposition allows rigorous elimination, as shown on simulated salary data and UCI Adult.

6. Integration with Broader Explanation Frameworks

Functional decomposition under marginal identification creates a mathematical bridge between local and global explanations, encompassing:

7. Practical Implications and Trade-offs

This unification addresses critical needs in explainable AI:

  • Separates out main/interaction effects so practitioners can diagnose local anomalies, interaction-induced bias, or global functional structure.
  • Enables robust feature importance assessments not confounded by interaction-induced cancellation.
  • Provides actionable pipelines for post-hoc fairness interventions by component removal, rather than ambiguous retraining.
  • Offers scalable algorithms amenable to tree ensembles and structured models.

The key limitation remains the exponential scaling of decomposition with dimension fS(xS)f_S(x_S)3 for general black-box models absent low-order structure. Approximate methods, surrogates, and rule-based reductions remain essential for scalability (Seppäläinen et al., 14 Feb 2025, Setzu et al., 2021).


In summary, local and global explanation methods are formalized and unified by functional decomposition with marginal identification constraints. This framework rigorously subsumes interventional SHAP, partial dependence, feature effect curves, and enables principled algorithms for bias reduction and feature importance. Empirical findings demonstrate increased fidelity, interpretability, and robustness, particularly for models and domains with rich interaction structures (Hiabu et al., 2022).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Local and Global Explanation Methods.