Spherical Spatial Autoregressive Model
- Spherical spatial autoregressive models are advanced frameworks that capture spatial dependencies on spherical domains using spectral harmonics and optimal transport techniques.
- They employ isotropic kernels and LASSO regularization to estimate harmonic coefficients, achieving dimensionality reduction and improved predictive accuracy.
- These models provide rigorous inference, uncertainty quantification, and practical applications in fields such as geochemistry and demography.
Spherical spatial autoregressive models generalize classical spatial autoregression to data residing on spherical domains, such as or more generally a unit sphere in a separable Hilbert space. These models accommodate rotational invariance and the non-Euclidean geometry intrinsic to spherical data, providing a framework for rigorous spatial modeling, inference, and prediction in geochemistry, demography, and other disciplines. Three principal formulations have emerged: spectral-domain operator-based SPHAR models (Caponera et al., 2019), parametric functional SAR models (Caponera et al., 2021), and optimal transport geometric SSAR models (Xu et al., 23 Jan 2026).
1. Model Specification and Structural Foundations
The canonical order-1 spherical SAR model posits a mean-zero, stationary-in-time Gaussian random field on the sphere, isotropic in space:
where is a bounded (Hilbert–Schmidt) linear operator or autoregressive kernel, and is isotropic spherical white noise with strictly positive angular power spectrum (Caponera et al., 2019, Caponera et al., 2021). is typically represented as convolution with an isotropic kernel , or equivalently, via expansion in spherical harmonics or Legendre polynomials.
In the optimal-transport based SSAR framework, observations are points on the unit sphere in a Hilbert space , with model residuals defined by the logarithmic Fréchet difference (where is the population Fréchet mean). The spatial dependence is enforced by:
where is a symmetric, row-standardized spatial-weights matrix constructed from spherical neighborhood structure (Xu et al., 23 Jan 2026).
2. Spectral Diagonalization and Parametric Dynamics
Isotropy of the field implies commutes with rotations, and is thus diagonalizable by spherical harmonics . For any ,
with eigenvalues controlling amplification/attenuation of spatial frequencies. For strictly isotropic cases (Caponera et al., 2019, Caponera et al., 2021). Stationarity and existence of the solution require .
Parametric extensions enforce regular variation on the spectral parameters, e.g.
with noise spectrum (Caponera et al., 2021). The harmonic-domain dynamics decouple into
and estimation targets underlying power-law spectral decay.
3. Estimation Procedures and Oracle Properties
Three estimation regimes prevail. In the spectral SPHAR model, LASSO regularization is imposed on harmonics:
with , . Coordinate descent (soft-thresholding) solves the decoupled scalar AR(1) regressions in the harmonic domain (Caponera et al., 2019).
Parametric models employ nonlinear least-squares, formulating a reduced objective in by profile minimization in , with closed-form and
yielding consistent and asymptotically normal estimators (Caponera et al., 2021).
The SSAR model adopts a GMM strategy, leveraging moment conditions based on empirical covariances of Hilbert–Schmidt residuals and instruments (typically ). The estimate
is computed via robust 1D optimization, with asymptotic normality established under broad regularity (Xu et al., 23 Jan 2026).
4. Asymptotic Theory and Statistical Testing
SPHAR(1) LASSO estimators obey nonasymptotic oracle bounds of order for -sparse coefficient vectors, and analogous prediction error rates. The parametric nonlinear least-square estimator is weakly consistent and asymptotically normal:
where is an explicit function of fourth-order moments of the harmonic coefficients and model parameters (Caponera et al., 2021).
In the SSAR context, Wald-type test statistics
are asymptotically under the null hypothesis , with bootstrap refinement enhancing finite-sample calibration. The GMM estimator is unbiased with RMSE decaying at for fixed neighbor number . Laws of large numbers for spatially dependent arrays and delta-method arguments underpin the convergence proofs (Xu et al., 23 Jan 2026).
5. Uncertainty Quantification and Prediction
SSAR models support non-Euclidean, distribution-free, conformal prediction for uncertainty quantification. The split-conformal method partitions data into training/calibration, fits the SSAR, computes conformity scores (residual norms in ), and defines prediction sets on via empirical quantiles. Theoretical guarantees ensure coverage up to (Xu et al., 23 Jan 2026). Jackknife+ variants achieve improved, nearly exact finite-sample coverage. The prediction back to the sphere uses the Rodrigues/exponential map to lift operator residuals to points on , maintaining interpretability in terms of geodesic radii.
In the spectral SPHAR context, LASSO-induced sparsity results in prediction using only a subset of dominating harmonics:
yielding computational efficiency and improved accuracy when underlying spatial dynamics are low-dimensional (Caponera et al., 2019).
6. Practical Implementation and Applications
Construction of the spatial-weights matrix in SSAR requires explicit consideration of spherical geometry. Neighbors are chosen in the physical coordinate domain; kernels such as uniform or Gaussian, with bandwidth , define weights and are typically row-standardized. In high-dimensional regimes, principal component analysis is recommended for error covariance estimation, though bootstrap is preferable in very large spaces (Xu et al., 23 Jan 2026).
Real-data applications—Spanish GEMAS geochemical composition and Japanese age-at-death distributions—demonstrate pronounced spatial autoregressive effects ( and $0.41$ respectively). SSAR and SRMSAR yield lower geodesic and compositional errors compared to Euclideanized alternatives, with significantly reduced computation times. In some scenarios, inclusion of exogenous covariates via SRMSAR improves out-of-sample performance; in others, purely spatial models are sufficient (Xu et al., 23 Jan 2026).
7. Interpretative Implications and Model Comparison
A central interpretative feature of spectral SPHAR is harmonic-domain sparsity: only a minority of spherical harmonics contribute to space-time dependence (multipolar selection). This regularization guards against overfitting high-frequency noise and achieves dimensionality reduction (). The SSAR class leverages optimal transport geometry, extending autoregressive concepts to intrinsic spherical domains, and provides a unified theory for estimation and inference in both finite- and infinite-dimensional settings.
Ignoring spherical geometry in model specification—i.e., by “Euclideanizing” data—induces substantial prediction errors and inefficient inference. Spherical SAR frameworks adapt both kernel construction and spatial regression principles to the manifold context, combining computational feasibility with comprehensive theoretical guarantees (Caponera et al., 2019, Caponera et al., 2021, Xu et al., 23 Jan 2026).