Papers
Topics
Authors
Recent
Search
2000 character limit reached

Orthant Normal Distribution

Updated 1 January 2026
  • Orthant normal distribution is defined as the probability that all components of a multivariate Gaussian are nonnegative, with applications in Gaussian process theory and polyhedral geometry.
  • Specialized techniques like Steck’s reduction for equicorrelated structures and asymptotic bounds provide efficient estimation methods for these multidimensional probabilities.
  • The holonomic gradient method reformulates the problem as a system of ODEs, offering enhanced numerical stability and competitive accuracy for high-dimensional integration challenges.

An orthant normal distribution is a classical but highly nontrivial object in the study of multivariate Gaussian measures: it refers to the computation of the probability that a multivariate normal vector falls within a specified orthant, most often the non-negative orthant, i.e., the set {xRn:x10,,xn0}\{x\in\mathbb{R}^n:x_1\ge0,\dots,x_n\ge0\}. The study of such probabilities is essential in multivariate probability, Gaussian process theory, polyhedral geometry, and has nontrivial applications in random polynomials on the simplex. For equicorrelated covariance structures, this problem admits specialized reductions and asymptotic analysis, while modern computation exploits connections to holonomic systems.

1. Formal Definition and Integral Representation

Let X=(X1,,Xn)Nn(0,Σ)X=(X_1,\dots,X_n)\sim N_n(0,\Sigma), where every XiX_i is real-valued and Σ\Sigma is a symmetric positive-definite covariance matrix. The orthant probability for the non-negative orthant is

f(n,Σ)=P{X10,,Xn0}=[0,)nϕn(x;0,Σ)dxf(n,\Sigma) = P\{X_1\ge0,\ldots,X_n\ge0\} = \int_{[0,\infty)^n} \phi_n(x;0,\Sigma)\,dx

with multivariate density

ϕn(x;0,Σ)=(2π)n/2det(Σ)1/2exp(12xTΣ1x).\phi_n(x;0,\Sigma) = (2\pi)^{-n/2}\det(\Sigma)^{-1/2}\exp\left(-\frac{1}{2}x^T\Sigma^{-1}x\right).

For the equicorrelated case Σij=1\Sigma_{ij} = 1 for i=ji = j and Σij=ρ(0,1)\Sigma_{ij} = \rho\in(0,1) for iji\ne j, the orthant probability admits a one-dimensional reduction due to Steck (1962): f(n,ρ)=E[Φn(Zs)]f(n,\rho) = \mathbb{E}[\Phi^n(Z\sqrt{s})] where s=ρ/(1ρ)s=\rho/(1-\rho), ZN(0,1)Z\sim N(0,1), and Φ\Phi is the univariate normal CDF (Pinasco et al., 2020). More generally,

Φn(μ,Σ)=P(X10,,Xn0)=[0,)nϕn(x;μ,Σ)dx\Phi_n(\mu,\Sigma)=P(X_1\ge0,\dots,X_n\ge0) = \int_{[0,\infty)^n} \phi_n(x;\mu,\Sigma)\,dx

with extensions allowing arbitrary mean and covariance (Koyama et al., 2012).

2. Analytical Bounds for the Equicorrelated Orthant Probability

For the equicorrelated normal, f(n,ρ)f(n, \rho) displays distinct asymptotic regimes depending on ρ\rho:

  • For ρ>1/2\rho > 1/2 and n2n\ge2: There exists a universal constant c1>0c_1>0 such that

c12+(1/ρ1)lognf(n,ρ)n11/ρ1221/ρ1ρρ(1+B(2,1/ρ1))\frac{c_1}{\sqrt{2+(1/\rho-1)\log n}} \le \frac{f(n,\rho)}{n^{1-1/\rho}} \le \frac{1}{2^{2-1/\rho}\sqrt{\frac{1-\rho}{\rho}(1+B(2,1/\rho-1))}}

with B(a,b)=Γ(a)Γ(b)/Γ(a+b)B(a,b) = \Gamma(a)\Gamma(b)/\Gamma(a+b) the Beta function.

  • For ρ<1/2\rho < 1/2 and suitably large nn:

1221/ρ1ρρ(Γ(1/ρ1)1)f(n,ρ)n11/ρ\frac{1}{2^{2-1/\rho}\sqrt{\frac{1-\rho}{\rho}(\Gamma(1/\rho-1)-1)}} \le \frac{f(n,\rho)}{n^{1-1/\rho}}

and for all nn0(ρ)n\ge n_0(\rho),

f(n,ρ)n11/ρ1ρρ[(1/ρ1)log2n]1/ρ2\frac{f(n,\rho)}{n^{1-1/\rho}} \le \sqrt{\frac{1-\rho}{\rho}[(1/\rho-1)\log^2 n]^{1/\rho-2}}

For all fixed ρ(0,1)\rho\in(0,1), f(n,ρ)n11/ρf(n,\rho) \asymp n^{1-1/\rho} up to polylogarithmic factors in nn (Pinasco et al., 2020).

Steck’s reduction, Markov inequalities, and Mill’s ratio-type bounds underpin these estimates, with critical refinements available for the logarithmic correction regimes in nn.

3. Holonomic Gradient Method and Pfaffian Systems

Direct computation of orthant probabilities for arbitrary covariance matrices is numerically challenging. The holonomic gradient method (HGM) establishes a system of linear PDEs—the holonomic system—annihilating the multivariate integral

g(x,y)=[0,)nexp(12i,j=1nxijtitj+i=1nyiti)dt1dtn,g(x,y) = \int_{[0,\infty)^n} \exp\left(\frac{1}{2}\sum_{i,j=1}^n x_{ij} t_i t_j + \sum_{i=1}^n y_i t_i\right) dt_1\cdots dt_n,

where parameters are x=Σ1x=-\Sigma^{-1} and y=Σ1μy=\Sigma^{-1}\mu (Koyama et al., 2012). The corresponding orthant probability is then

Φn(μ,Σ)=(2π)n/2Σ1/2exp(12μTΣ1μ)g(x,y).\Phi_n(\mu,\Sigma) = (2\pi)^{-n/2}|\Sigma|^{-1/2}\exp\left(\frac{1}{2}\mu^T\Sigma^{-1}\mu\right)g(x,y).

The HGM reformulates the problem as a first-order Pfaffian ODE for the vector G(x,y)={gJ(x,y):J{1,,n}}G(x,y) = \{g_J(x,y): J\subset\{1,\ldots,n\}\} of 2n2^n components, with recurrence relations for derivatives in xijx_{ij} and yiy_i. This reduction ensures that the numerical procedure is well-conditioned as long as the covariance remains positive definite.

The algorithm iteratively integrates from a diagonal start (zero mean, diagonal covariance) to the desired (xtarget,ytarget)(x_{\mathrm{target}}, y_{\mathrm{target}}) along a straight path, solving the ODE with a method such as fourth-order Runge–Kutta. The principal computational cost stems from the exponential state scaling, O(n22nNsteps)O(n^2 2^n N_\text{steps}), but practical calculations are reported up to n=50n=50 under moderate accuracy (Koyama et al., 2012).

4. Plackett Recurrence and Comparative Numerical Strategies

Classical approaches, such as Plackett's 1954 recurrence, relate derivatives of orthant probabilities to lower-dimensional analogs through explicit recurrence relations. However, the recursion involves denominators 1ρij2\sqrt{1-\rho_{ij}^2}, introducing numerical instability for near-singular correlation matrices or as ρij1|\rho_{ij}| \uparrow 1.

The holonomic gradient method, by contrast, eliminates such problematic factors and operates on rational recurrences over an exponentially-sized state space, yielding improved stability on high-dimensional or ill-conditioned problems. Comparative studies indicate HGM’s competitive or superior computational timing for large nn and improved accuracy benchmarks up to 10910^{-9} (Koyama et al., 2012).

5. Asymptotics and Applications in Random Polynomials on the Simplex

A substantial application concerns random homogeneous polynomials evaluated on the standard nn-simplex Δ\Delta. Specifically, for a random Bombieri-normalized kk-homogeneous polynomial PP, the event that PP attains a (relative) maximum at a vertex aia_i is equivalent to the event that an associated equicorrelated multivariate normal X(ai)X(a_i) has all positive entries. The relevant correlation parameter for vertices is

ρn=nk+(k1)n(k+1)+(k1)kk+1as n.\rho_n = \frac{nk + (k-1)}{n(k+1) + (k-1)} \to \frac{k}{k+1} \quad \text{as } n\to\infty.

For k>4k>4, the events {P has max at ai}\{P \text{ has max at } a_i\} become almost independent in total variation as nn\to\infty, and the probability that PP has a maximum at some vertex converges to $1$ (Pinasco et al., 2020).

6. Numerical Implementation and Performance

The HGM algorithm operates through an initial parameter transformation to (x,y)(x, y), initialization on a diagonal covariance, ODE setup via block-sparse recurrences, and numerical integration along a chosen path. Key implementation items include:

  • Sparse matrix optimization given that each recurrence couples subsets differing by at most two elements.
  • Precomputation of recurrence sparsity patterns for efficiency.
  • Monitoring of covariance positivity to stay within the holonomic domain.

Numerical experiments reported by Koyama & Takemura show high-precision agreement with exact results (up to 10910^{-9}), competitive runtimes for small nn compared to Miwa–Hayter–Kuriki recursion, and clear acceleration for large nn (e.g., HGM is faster for n=20,50n=20,50). Limitations can occur when μ\|\mu\| is large and Σ\Sigma nearly singular, causing ODE stiffness (Koyama et al., 2012).

7. Connections to Broader Contexts

Orthant normal probabilities underpin various problems in statistical inference, Gaussian process excursion sets, and the study of random fields, as well as in polyhedral geometry and stochastic optimization. Their computation, both by analytic reduction in symmetric (equicorrelated) cases and by holonomic systems in the general setting, remains an active area with substantial algorithmic and theoretical developments (Pinasco et al., 2020, Koyama et al., 2012).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (2)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Orthant Normal Distribution.