GeoShapley: Explainable AI for Spatial Modeling

Updated 21 January 2026

GeoShapley is a model-agnostic explainable AI approach that adapts Shapley values to spatial data by quantifying nonlinear effects and geographic heterogeneity.
It decomposes model predictions into intrinsic location effects, global feature contributions, and feature–location interactions to yield clear additive attributions.
GeoShapley leverages an adapted Kernel SHAP algorithm for efficient coalition sampling, offering interpretable outputs that support geospatial analysis and policy design.

GeoShapley is a model-agnostic explainable AI (XAI) approach for spatial machine learning models, fundamentally extending game-theoretic Shapley values to quantify both nonlinear feature effects and spatial heterogeneity in predictive modeling. GeoShapley conceptualizes geographic location as a single joint “player” in the Shapley value decomposition, enabling the separation of intrinsic spatial context, global feature contributions, and synergistic feature–location interactions. It is widely applied for post-hoc interpretation of black-box regressors—such as XGBoost, Random Forest, and TabNet—operating on georeferenced tabular data and spatial feature embeddings. GeoShapley’s outputs are directly interpretable as additive decompositions of model predictions, mapping intrinsic location effects and spatially varying coefficients, and providing tract-level attribution essential for geospatial analysis, regression, and policy design (Li, 2023 Lu et al., 17 Dec 2025 Li, 1 May 2025 Li et al., 16 Apr 2025 Liu, 2024).

1. Theoretical Foundation and Mathematical Formulation

GeoShapley generalizes the classic Shapley value framework from cooperative game theory to the spatial domain. For a model $f(x)$ with $p$ features and $g$ geographic coordinates (typically $g=2$ for latitude and longitude), GeoShapley decomposes the prediction for each instance $i$ as

$y_i = \Phi_0 + \Phi_{\text{GEO},i} + \sum_{j=1}^{p-g} \Phi_{j,i} + \sum_{j=1}^{p-g} \Phi_{(\text{GEO}, j),i}$

where:

$\Phi_0$ is the global baseline (expected prediction).
$\Phi_{\text{GEO},i}$ is the intrinsic location effect, representing pure spatial context independent of other features.
$\Phi_{j,i}$ is the global, location-invariant effect of feature $j$ , capturing nonlinear global relationships.
$p$ 0 is the synergistic interaction between location (GEO) and feature $p$ 1, quantifying spatial heterogeneity of feature effects (Li, 2023 Lu et al., 17 Dec 2025).

Mathematical expressions for each component employ combinatorial averaging over all feature orderings, adapting the denominators to account for the joint treatment of spatial features. For example, the location effect is given by

$p$ 2

Interaction effects are computed using differences formed by including and excluding both location and a feature from the coalition:

$p$ 3

and

$p$ 4

This additive decomposition respects coalitional game-theory efficiency and provides unbiased local estimates for all spatial and non-spatial effect components (Li, 2023 Li, 1 May 2025).

2. Computation: Kernel-SHAP Algorithm and Practical Implementation

GeoShapley relies on an adapted Kernel SHAP estimator for efficient approximation of Shapley values in high-dimensional models. Key steps include:

Model Fitting: Train a black-box ML model (e.g., XGBoost) on all features including spatial coordinates.
Coalition Sampling: For each instance, sample $p$ 5 random feature orderings, treating the GEO block as a joint player.
Marginalization: For each coalition, marginalize absent features by imputing background reference values (bootstrapped or clustered).
Computation: Evaluate model outputs with present features, accumulating marginal contributions for each coalition using the appropriate Shapley weights.
Decomposition: Average across sampled coalitions to estimate $p$ 6, $p$ 7, $p$ 8, and $p$ 9 per instance.

The geoshapley Python package implements these steps, supporting both exact enumeration (small $g$ 0) and Monte Carlo kernel sampling (large $g$ 1), with background set sizes typically in the 50–300 range for computational efficiency. For spatial coefficient surface estimation, outputs can be further smoothed by geographically weighted regression (GWR) (Li, 2023 Li et al., 16 Apr 2025 Li, 1 May 2025 Liu, 2024).

Algorithm Step	Key Operation	Computational Complexity
Coalition sampling	Random orderings	$g$ 2
Marginalization	Background imputation	$g$ 3 model evaluations
Additive decomposition	Linear weighting	$g$ 4

Practical recommendations include bootstrapping for uncertainty quantification, and careful validation of base model fidelity prior to GeoShapley analysis (Li, 1 May 2025 Li, 2023).

3. Interpretative Structure: Spatially Varying Effects and Additive Attribution

GeoShapley directly parallels the interpretive structure of spatially varying coefficient models (SVCMs) and additive models. In GWR, regression is written as $g$ 5; GeoShapley provides:

$g$ 6 as the analog of $g$ 7 (spatial fixed effect).
$g$ 8 as the global main effect (analogous to $g$ 9 in an additive model).
$g=2$ 0 as a spatially varying coefficient surface for each feature, recoverable via univariate smoothing (Li, 2023 Li et al., 16 Apr 2025).

Unlike standard SHAP, which isolates only global main effects (and conflates spatial and non-spatial contributions), GeoShapley explicitly separates intrinsic geographic context from feature–location interactions, yielding interpretable tract- or pixel-level maps essential for policy and scientific inference (Lu et al., 17 Dec 2025).

4. Comparative Evaluation: SHAP, MGWR, Moran Eigenvectors, and Ensemble Extensions

GeoShapley has been comparatively evaluated against:

Standard SHAP: Captures only global feature effects ( $g=2$ 1), failing to separate spatial heterogeneity or identify location–feature synergies.
MGWR: Produces parametric, smoothly varying coefficient surfaces using kernel bandwidth selection but is limited in nonlinear functional recovery and requires explicit kernel choice (Lu et al., 17 Dec 2025 Li, 1 May 2025 Liu, 2024).
Moran Eigenvector Spatial Filtering (ESF): ESF embeds spatial eigenvectors as features for controlling spatial autocorrelation but lacks direct marginal attribution of local spatial context; GeoShapley can treat high-dimensional spatial embeddings as the geo-player, with computational caveats when eigenvector blocks are large (Li et al., 16 Apr 2025).
XGeoML: Recent ensemble frameworks such as XGeoML generalize GeoShapley by aggregating multiple local models (GBR, RF, MLP, etc.) and multiple explainers (SHAP, LIME, FI) using spatial weighting and reliability-based ensemble aggregation. XGeoML enhances predictive and interpretive accuracy, particularly under strong geography–covariate nonlinearity (Liu, 2024).

Method	Nonlinearity	Spatial Heterogeneity	Model Assumptions
SHAP	Yes	No	None
MGWR	Limited	Yes	Linear/Kernels
ESF	No	Indirect	Linear
GeoShapley	Yes	Yes	None
XGeoML	Yes	Yes (robust/ensemble)	None

GeoShapley is unique in jointly explaining nonlinearity and spatial heterogeneity, while remaining model-agnostic (Lu et al., 17 Dec 2025 Li, 2023).

5. Case Studies and Empirical Applications

GeoShapley has been utilized in various domains:

Synthetic Data Validation: GeoShapley recovers ground-truth spatial intercepts and varying coefficients for simulated spatial fields, accurately reflecting both nonlinear and spatial processes (validated by spatial correlations exceeding 0.9 for coordinate-based models) (Li et al., 16 Apr 2025 Li, 2023).
Traffic Crash Density (Florida): GeoShapley reveals nonlinear effects such as threshold-driven crash risk in compact neighborhoods (score > 7), spatially amplifies urban crash contributions (Miami, Orlando, Tampa, Jacksonville), and identifies corridor-specific susceptibility (I-95, downtown cores). Interaction terms elucidate spatial variability in risk factors, facilitating targeted interventions (traffic calming, speed management, equity-based policy) (Lu et al., 17 Dec 2025).
Political Science (Voting Behaviors): County-level applications isolate both S-shaped nonlinearities in demographic predictors and spatially structured party preference patterns (e.g., South vs. Northeast) (Li, 1 May 2025).
Real Estate Modeling: In Seattle house price modeling, location exceeds all other features in explanatory value, with spatial interactions (e.g., house age premium in historic neighborhoods) directly mapped via GeoShapley outputs (Li, 2023).

Visualization strategies include mapping $g=2$ 2 (intrinsic context), partial dependence plots of $g=2$ 3 (global nonlinearities), and spatial coefficient surfaces via $g=2$ 4 (Lu et al., 17 Dec 2025 Li, 2023).

6. Limitations, Implementation Considerations, and Future Directions

GeoShapley has several limitations and considerations:

Computational Intensity: Kernel SHAP estimates require hundreds of background samples and numerous coalition evaluations per location; computational cost scales with feature and geo-player block size (Li, 2023 Li et al., 16 Apr 2025).
Variance and Stability: Monte Carlo approximation and choice of background samples influence variance; large GEO blocks (e.g., Moran eigenvectors) amplify estimation noise unless coalition sampling is increased (Li et al., 16 Apr 2025).
Interpretation: Shapley values are marginal contributions, not regression slopes. Naively mapping $g=2$ 5 can mislead without smoothing and robustness checks (Li, 1 May 2025).
Model Fidelity: Explanatory insight is conditional on the accuracy and fit of the underlying ML model; robust preprocessing and diagnostics are essential (Li, 1 May 2025).
Kernel Bandwidth: For spatial smoothing, bandwidth selection critically affects smoothness and pattern recovery (Liu, 2024).

Proposed advances include causal Shapley extensions (direct/indirect effects), improved visualization/inference for high-dimensional spatial partial dependence, integration with bias detection and spatial fairness, and development of transferable spatial XAI models (Li, 1 May 2025 Liu, 2024).

7. Software Ecosystem and Workflow Recommendations

Implementation of GeoShapley is supported by open-source tools including the geoshapley Python package (pip install geoshapley), leveraging scikit-learn, xgboost, flaml (AutoML), mgwr (MGWR reference), pysal, geopandas, and matplotlib. Recommended workflow:

Collect and preprocess georeferenced tabular data.
Fit a flexible ML regressor (ensemble/tree/NN), validate via cross-validation and hyperparameter optimization.
Compute GeoShapley values using geoshapley, specifying model, background set, and coordinate block.
Assess uncertainty via bootstrap resampling and reporting of confidence intervals.
Visualize intrinsic location and spatially varying coefficients via map plots and partial dependence curves.
Compare outputs to MGWR or ESF for benchmarking.
Document all code, data, and parameter choices for reproducibility (Li, 2023 Li, 1 May 2025).

Usage and performance tips include exact enumeration for small feature sets, Monte Carlo sampling for larger sets, background selection via clustering, and ensemble aggregation for robust spatial coefficient estimation in XGeoML (Li, 2023 Liu, 2024).