Papers
Topics
Authors
Recent
Search
2000 character limit reached

Orthogonal Decomposition of the Tangent Space

Updated 1 January 2026
  • Orthogonal decomposition of the tangent space partitions a statistical manifold into a covariate subspace and its orthogonal residual, enabling finite-dimensional Fisher information analysis.
  • The covariate Fisher Information Matrix (cFIM) is derived by restricting the Fisher-Rao metric to the covariate subspace, bridging geometric, probabilistic, and statistical inference.
  • This framework underpins variance bounds, efficient score computations, and intrinsic dimensionality tests, enhancing inference in autoregressive and error-prone regression models.

The Covariate Fisher Information Matrix (cFIM) is a formalism that generalizes and renders tractable the Fisher information in a range of statistical and information-geometric contexts where covariates are either observed with error, endogenous, or serve as the primary locus of meaningful variation. It plays a critical role in nonparametric information geometry, autoregressive modeling, and error-prone regression scenarios, offering both a finite-dimensional representative of information theoretic quantities and a concrete bridge between geometric, probabilistic, and statistical inference frameworks (Cheng et al., 25 Dec 2025, Gao et al., 2017, Heavens et al., 2014).

1. Foundational Definition and Geometric Framework

The cFIM arises from an orthogonal decomposition of the tangent space TfMT_fM of an infinite-dimensional nonparametric statistical manifold MM, where elements are smooth, positive probability densities ff on Rn\mathbb R^n. The tangent space to MM at ff is given by

TfM={hC(Rn):h(x)dx=0}T_fM = \{h \in C^\infty(\mathbb R^n) : \int h(x)\, dx = 0\}

with the Fisher-Rao metric functional

gf(h1,h2)=Rnh1(x)h2(x)f(x)dx.g_f(h_1, h_2) = \int_{\mathbb R^n} \frac{h_1(x) h_2(x)}{f(x)}\, dx.

Choosing observed coordinates x=(x1,...,xn)x = (x_1, ..., x_n), one defines the covariate subspace STfMS \subset T_fM as

S=span{if:i=1,...,n},S = \mathrm{span}\Bigl\{ \partial_i f : i = 1, ..., n \Bigr\},

with if=fxi\partial_i f = \frac{\partial f}{\partial x_i}. This leads to an orthogonal decomposition

TfM=SS,T_fM = S \oplus S^\perp,

where SS^\perp is the residual subspace orthogonal to SS. Every hTfMh \in T_fM has a unique decomposition h=hS+εh = h_S + \varepsilon with hSSh_S \in S, εS\varepsilon \in S^\perp (Cheng et al., 25 Dec 2025).

2. cFIM: Finite-Dimensional Realization and Statistical Properties

Restricting the Fisher-Rao metric to SS, one defines the cFIM GfG_f as

(Gf)ij=gf(if,jf)=EXf[si(X)sj(X)](G_f)_{ij} = g_f(\partial_i f, \partial_j f) = \mathbb E_{X \sim f}[s_i(X) s_j(X)]

where si(x)=ilnf(x)s_i(x) = \partial_i \ln f(x) are the coordinate score functions. In matrix form,

Gf=(Ef[ilnfjlnf])i,j=1n.G_f = \left( \mathbb E_f[\partial_i \ln f \, \partial_j \ln f] \right)_{i,j=1}^n.

Under mild regularity, GfG_f is positive-definite and invertible. It serves as a computable, finite-dimensional approximation to the infinite-dimensional Fisher-Rao metric, capturing the total Fisher information available in the observed covariates.

A key result is the Trace Theorem:

HG(f):=Ef[lnf(X)2]=Tr(Gf),H_G(f) := \mathbb E_f[\| \nabla \ln f(X) \|^2] = \mathrm{Tr}(G_f),

where HG(f)H_G(f), known as G-entropy, quantifies the total explainable information from the observed covariates (Cheng et al., 25 Dec 2025).

3. Connections to KL Divergence and Variance Bounds

The cFIM is linked to the curvature of the Kullback-Leibler divergence:

gf(h,h)=d2dt2t=0DKL(fft)g_f(h, h) = \left.\frac{d^2}{dt^2}\right|_{t=0} D_{\mathrm{KL}}(f \| f_t)

for a smooth path ftf_t with f0=ff_0 = f and tangent f˙tt=0=h\dot f_t|_{t=0} = h. For covariate-direction tangents hi=fsih_i = f s_i,

(Gf)ii=d2dt2t=0DKL(ffi,t)(G_f)_{ii} = \left.\frac{d^2}{dt^2}\right|_{t=0} D_{\mathrm{KL}}(f \| f_{i,t})

and summing over ii yields HG(f)H_G(f).

A nonparametric Covariate Cramér–Rao Lower Bound (CRLB) is established:

AsyCov(θ^)Gf1,\mathrm{AsyCov}(\hat \theta) \succeq G_f^{-1},

providing fundamental variance limits for regular estimators, contingent on geometric alignment (Cheng et al., 25 Dec 2025).

4. Relation to Efficient Fisher Information in Semi-Parametric Models

In a semi-parametric framework with parameter of interest θ=T(f)Rd\theta = T(f) \in \mathbb R^d and nuisance parameter η(f)\eta(f), the efficient Fisher information IeffI_{\mathrm{eff}} is the covariance of the efficient score—orthogonal projection of the full score onto the complement of the nuisance tangent space. Under a Geometric Alignment Postulate where the efficient score coincides with covariate scores,

Gf=Ieff(θ),G_f = I_{\mathrm{eff}}(\theta),

demonstrating that GfG_f provides the relevant information bound for semi-parametric estimation (Cheng et al., 25 Dec 2025).

5. cFIM in Autoregressive and Covariate-Dependent Models

In autoregressive models for time series or regression models with endogenous covariates, the cFIM (also called the exact conditional Fisher information matrix) captures the information properly accounting for lagged input dependence. For an order-pp logistic autoregressive model with or without exogenous covariates, the exact cFIM is

Ic(ϑI0)=t=p+1Tytw(ϕt;ϑ)ϕtϕtQt(yt1,...,ytp),I_c(\vartheta | \mathcal I_0) = \sum_{t=p+1}^T \sum_{y_{-t}} w(\phi_t; \vartheta) \, \phi_t \phi_t^\top Q_t(y_{t-1}, ..., y_{t-p}),

with explicit definitions for ww, ϕt\phi_t, and the lag-block transition kernels QtQ_t. Efficient recursive computation is possible and essential for finite sample inference, yielding variance estimates and confidence intervals that can be narrower and more accurate than empirical approximation, especially when endogenous structure is non-negligible (Gao et al., 2017).

6. cFIM for Data with Measurement Error in Covariates

In regression and forecasting when both coordinates (X,Y)(X, Y) are measured with error, the cFIM formalism marginalizes over latent variables and propagates all error covariances. The marginal likelihood is characterized by an effective covariance

R=CYYCXYTTTTCXY+TCXXTTR = C_{YY} - C_{XY}^T T^T - T C_{XY} + T C_{XX} T^T

where TT is the Jacobian of the model with respect to xx, and the Fisher information for model parameters is expressed solely in terms of RR, extending the Fisher approach beyond regimes where covariate errors can be ignored (Heavens et al., 2014).

7. Intrinsic Dimensionality, the Manifold Hypothesis, and Information Capture

The cFIM framework facilitates rigorous investigation of the Manifold Hypothesis (MH), which posits that data in high-dimensional ambient spaces are supported near a lower-dimensional submanifold. The information-capture subspace is characterized by the rank of GfG_f: rank-deficiency in GfG_f is a testable condition for the MH.

The Information-Capture Ratio—the ratio of the trace of GfG_f attributable to the effective signal coordinates—serves as a quantitative intrinsic dimensionality estimator, directly enabling the transition from heuristic to testable geometric claims about the structure of high-dimensional data (Cheng et al., 25 Dec 2025).


Summary Table: Principal cFIM Formulations

Context cFIM Formula Reference
Information geometry (Gf)ij=Ef[ilnfjlnf](G_f)_{ij} = \mathbb{E}_f[\partial_i\ln f\, \partial_j\ln f] (Cheng et al., 25 Dec 2025)
Logistic autoregression, endogenous input Ic(ϑI0)=w(ϕt;ϑ)ϕtϕtQtI_c(\vartheta|\mathcal{I}_0) = \sum w(\phi_t;\vartheta)\phi_t\phi_t^\top Q_t (Gao et al., 2017)
Regression with measurement errors Fαβ=12Tr[R1R,αR1R,β]+μ,αTR1μ,βF_{\alpha\beta} = \frac{1}{2}\mathrm{Tr}[R^{-1}R_{,\alpha}R^{-1}R_{,\beta}] + \mu_{,\alpha}^T R^{-1} \mu_{,\beta} (Heavens et al., 2014)

The Covariate Fisher Information Matrix thus provides a unifying and operationally tractable platform for modern inference and explainability in scenarios where the role, quality, and structure of the covariates are central to the statistical modeling problem.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (3)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Orthogonal Decomposition of the Tangent Space.