Papers
Topics
Authors
Recent
Search
2000 character limit reached

Statistical Jet Bundle: Information Geometry

Updated 21 November 2025
  • The statistical jet bundle is a geometric structure that organizes higher-order differential information and hierarchies of variance bounds in statistical models.
  • It employs jet bundles, contact geometry, and Cartan distributions to derive curvature corrections and integrability conditions for estimator efficiency.
  • The framework finds applications in both information geometry and collider phenomenology, enabling improved background rejection and measurement precision via ensemble jet analysis.

A statistical jet bundle is a geometric structure that systematically organizes higher-order differential information about statistical models, particularly encoding the hierarchy of variance bounds—including the Cramér–Rao and Bhattacharyya-type inequalities—within the framework of jet bundles, contact geometry, and Cartan distributions. Introduced and developed in the context of information geometry, the statistical jet bundle formalism provides a unified, intrinsic, and coordinate-free foundation for analyzing estimator efficiency, curvature corrections to variance lower bounds, and the geometric and differential-algebraic criteria for optimality of statistical estimators (Krishnan, 19 Nov 2025).

1. The Statistical Bundle and Jet Bundles

The foundational object is the statistical bundle E=Θ×H=R×L2(μ)E = \Theta \times H = \mathbb{R} \times L^2(\mu), where ΘR\Theta \cong \mathbb{R} is the parameter space with coordinate θ\theta, and the fibre over each θ\theta is the Hilbert space H=L2(μ)H = L^2(\mu) of square-integrable functions on the sample space. A section of EE is a mapping s:ΘHs: \Theta \to H, such as the square-root embedding sθ(x)=f(x;θ)s_\theta(x) = \sqrt{f(x;\theta)} associated with a parametric family of densities.

The mm-th order statistical jet bundle Jm(E)J^m(E) over ΘR\Theta \cong \mathbb{R}0 consists, at each ΘR\Theta \cong \mathbb{R}1, of all ΘR\Theta \cong \mathbb{R}2-jets (equivalence classes of local sections matching up to ΘR\Theta \cong \mathbb{R}3 derivatives at ΘR\Theta \cong \mathbb{R}4). In coordinates, an element of ΘR\Theta \cong \mathbb{R}5 is represented as ΘR\Theta \cong \mathbb{R}6, where each ΘR\Theta \cong \mathbb{R}7 denotes the ΘR\Theta \cong \mathbb{R}8-th derivative ΘR\Theta \cong \mathbb{R}9. Natural projections θ\theta0 forget higher derivatives, forming a tower of bundles.

2. Canonical Contact Forms and Total Derivatives

On the trivial finite-dimensional jet bundle θ\theta1, the standard contact 1-forms are

θ\theta2

These forms vanish along holonomic prolongations, i.e., lifts of genuine sections and their derivatives.

On the infinite-dimensional statistical jet bundle θ\theta3, θ\theta4 retains its form, with θ\theta5 interpreted as θ\theta6-valued and θ\theta7 as θ\theta8-valued 1-forms. For each θ\theta9 in the sample space, the scalar evaluation θ\theta0 matches the classical contact form at θ\theta1.

The total derivative, or Cartan vector field, on θ\theta2 is given by

θ\theta3

This operator generates the rank-1 Cartan distribution by annihilation of the contact forms.

3. Cartan Distribution, Ehresmann Connection, Torsion, and Curvature

The Cartan distribution θ\theta4 is a rank-1 distribution on θ\theta5, spanned by θ\theta6. For the bundle projection θ\theta7, the vertical bundle θ\theta8. The connection is encoded by requiring the contact 1-form θ\theta9 to vanish on horizontal vectors, yielding a decomposition H=L2(μ)H = L^2(\mu)0, with the horizontal component H=L2(μ)H = L^2(\mu)1 the kernel of H=L2(μ)H = L^2(\mu)2.

The torsion 1-form is calculated as

H=L2(μ)H = L^2(\mu)3

while the curvature 2-form is H=L2(μ)H = L^2(\mu)4. Non-zero curvature measures the non-integrability of H=L2(μ)H = L^2(\mu)5, representing the geometric source of curvature corrections in estimator variance bounds.

4. Efficient Models and ODE Submanifolds

A statistical model is termed “efficient up to order H=L2(μ)H = L^2(\mu)6” if the estimator residual function lies in the span of the first H=L2(μ)H = L^2(\mu)7 derivatives H=L2(μ)H = L^2(\mu)8, H=L2(μ)H = L^2(\mu)9. Equivalently, there exist coefficients EE0, not all zero, such that for all EE1 sample space,

EE2

The submanifold EE3, defined by the linear constraint EE4, specifies the locus of EE5-th order efficient models. The image of the EE6-jet prolongation EE7 must lie entirely in EE8 for all EE9.

5. Integrability, Variance Bounds, and Curvature Corrections

Classical information inequalities, such as the Cramér–Rao bound (CRB) and Bhattacharyya-type bounds, are re-expressed geometrically in the jet bundle formalism. An unbiased estimator s:ΘHs: \Theta \to H0 achieves s:ΘHs: \Theta \to H1-th order efficiency if and only if the residual s:ΘHs: \Theta \to H2 lies in the span of the first s:ΘHs: \Theta \to H3 derivatives of s:ΘHs: \Theta \to H4. This is algebraically equivalent to the assertion that s:ΘHs: \Theta \to H5 satisfies a homogeneous linear ODE of order s:ΘHs: \Theta \to H6.

Geometrically, s:ΘHs: \Theta \to H7 must be both contained in s:ΘHs: \Theta \to H8 and tangent to the Cartan distribution, i.e., s:ΘHs: \Theta \to H9, implying that sθ(x)=f(x;θ)s_\theta(x) = \sqrt{f(x;\theta)}0 is an integral submanifold for sθ(x)=f(x;θ)s_\theta(x) = \sqrt{f(x;\theta)}1. Non-integrability, measured by torsion or curvature, precisely quantifies the amount by which an estimator fails to achieve the refined bound, and the geometric structure supplies the necessary extrinsic corrections to the variance.

For sθ(x)=f(x;θ)s_\theta(x) = \sqrt{f(x;\theta)}2, the second fundamental form of the embedding sθ(x)=f(x;θ)s_\theta(x) = \sqrt{f(x;\theta)}3 is

sθ(x)=f(x;θ)s_\theta(x) = \sqrt{f(x;\theta)}4

and the variance of an unbiased estimator sθ(x)=f(x;θ)s_\theta(x) = \sqrt{f(x;\theta)}5 satisfies

sθ(x)=f(x;θ)s_\theta(x) = \sqrt{f(x;\theta)}6

where the first term is the inverse Fisher information and the second term arises from curvature. For sθ(x)=f(x;θ)s_\theta(x) = \sqrt{f(x;\theta)}7, higher-order “fundamental forms” yield Bhattacharyya-type corrections, and integrability of multiple intersecting efficiency ODEs signals the vanishing of all residual torsion/curvature, corresponding to full sθ(x)=f(x;θ)s_\theta(x) = \sqrt{f(x;\theta)}8-th order efficiency.

6. Jet Bundle Formalism in Collider Phenomenology

In collider phenomenology, a “statistical jet bundle” denotes the construction in which each collision event’s jet is represented by an ensemble of clustering trees, obtained by randomizing the clustering sequence according to probabilistic weights (as in the Q-jets formalism) (Ellis et al., 2012). Each jet thus gives rise to a bundle of trees over the same event, enabling one to empirically study the distribution of any observable sθ(x)=f(x;θ)s_\theta(x) = \sqrt{f(x;\theta)}9 across the ensemble.

For a given event, the Q-jets procedure randomizes the recombination of constituent four-vectors according to a parameter mm0 (“rigidity”), constructing multiple trees per jet. Observables are collected over this ensemble, defining the empirical distribution mm1 and associated summary statistics—mean, variance, and higher moments.

The width (variance) of these distributions serves as a powerful new discriminant: signal jets (e.g., boosted mm2) typically exhibit narrow mass distributions, while QCD jets display broad volatility. Application of volatility cuts yields significant improvements to signal significance and statistical efficiency, reducing required integrated luminosity by up to a factor of two for boosted-object searches, as detailed in (Ellis et al., 2012).

7. Unifying Principles and Significance

The statistical jet bundle provides a rigorous language for encoding estimator efficiency, variance bounds, and curvature corrections in a single geometric hierarchy. In the information geometry context, the data required for the statement “variance mm3 1/Fisher + curvature correction” is summarized as “the mm4-th prolonged section mm5 is an integral curve of the Cartan distribution restricted to the ODE submanifold mm6.” The jet bundle formalism links algebraic projection conditions and geometric integrability in a unified framework, offering new conceptual and technical insights into the geometry of statistical estimation (Krishnan, 19 Nov 2025).

In collider analysis, the statistical jet bundle (in the Q-jets sense) enables the exploration of the statistical properties of jet observables at the ensemble level, fostering enhanced stability of background rejection and improved measurement precision (Ellis et al., 2012). This dual usage underscores the unifying power of jet bundle formalism in both information geometry and physical data analysis contexts.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (2)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Statistical Jet Bundle.