Papers
Topics
Authors
Recent
Search
2000 character limit reached

Closed-Form Geodesics in Matrix Manifolds

Updated 5 February 2026
  • Closed-form geodesics are explicit analytical solutions for distance‐minimizing curves on Riemannian manifolds, particularly in matrix manifolds like the Stiefel, Grassmann, and flag manifolds.
  • They employ a two-parameter family of Riemannian metrics and offer both block-exponential and reduced formulations to efficiently compute geodesic paths, exponential maps, and logarithm maps.
  • The framework integrates Fréchet derivatives with trust-region minimization, yielding robust convergence in optimization and statistical applications on low-dimensional settings.

Closed-form geodesics are explicit analytical solutions for geodesics—locally distance-minimizing curves—on Riemannian manifolds, with particular focus on matrix manifolds such as the Stiefel, Grassmann, and flag manifolds. These formulas enable efficient computation of geodesics, exponential and logarithm maps for a two-parameter family of Riemannian metrics, crucial for algorithms in optimization and statistics on these spaces. The development of closed-form geodesics, together with Fréchet derivatives and trust-region minimization, forms a unified framework for calculating Riemannian logarithms and geodesic distances even when closed-form logarithm maps are unavailable (Nguyen, 2021).

1. Stiefel Manifold and Metric Family

The Stiefel manifold StK(n,p)={YKn×p:YtY=Ip}St_K(n,p) = \{ Y \in K^{n \times p} : Y^t Y = I_p \} consists of orthonormal pp-frames in KnK^n (real or complex). The tangent space at YY comprises matrices η\eta such that YtηY^t \eta is antisymmetric. The Riemannian metric is parameterized by two positive scalars α0,α1\alpha_0, \alpha_1: $g_Y(\omega, \xi) = \Tr_R[ \omega^t (\alpha_0 I_n + (\alpha_1 - \alpha_0) Y Y^t) \xi ]$ The metric interpolates between the embedded metric (α1=α0\alpha_1 = \alpha_0) and canonical metric (α1=12α0\alpha_1 = \frac12 \alpha_0). The associated raising operator has the form

gY(ω)=α0ω+(α1α0)YYtωg_Y(\omega) = \alpha_0 \omega + (\alpha_1-\alpha_0) Y Y^t \omega

This parameterization enables closed-form geodesic solutions that work efficiently for low-rank settings (pnp \ll n) (Nguyen, 2021).

2. Geodesic ODE and Closed-Form Solutions

The geodesic Y(t)Y(t) satisfies the second-order ODE: Y¨+YY˙tY˙+2(1α)(IYYt)(Y˙Y˙t)Y=0,α=α1α0\ddot Y + Y\,\dot Y^t \dot Y + 2(1-\alpha)\,(I - Y Y^t)\,(\dot Y\,\dot Y^t)\,Y = 0, \quad \alpha = \frac{\alpha_1}{\alpha_0} Initial conditions: Y(0)=YY(0) = Y, Y˙(0)=η\dot Y(0) = \eta, and the decomposition: A=Ytη,(IYYt)η=QR,QtQ=I, QtY=0, RKp×kA = Y^t \eta, \qquad (I - Y Y^t)\eta = Q R,\quad Q^t Q = I,\ Q^t Y = 0,\ R \in K^{p \times k} With S0=ηtη=A2+RtRS_0=\eta^t\eta = -A^2 + R^t R and matrix

A^=[2αARt R0]\hat A = \begin{bmatrix} 2\alpha A & -R^t \ R & 0 \end{bmatrix}

the geodesic admits two equivalent closed forms:

  • Block-exponential (dimension (p+k)(p+k)):

Y(t)=[Y Q]exp(tA^)[Ip 0]exp((12α)tA)Y(t) = [Y\ Q] \exp(t \hat A) \begin{bmatrix} I_p \ 0 \end{bmatrix} \exp( (1 - 2\alpha)tA )

  • Reduced (dimension $2p$):

Y(t)=(YM(t)+QN(t)) exp((12α)tA)Y(t) = (Y M(t) + Q N(t))\ \exp( (1 - 2\alpha)tA )

with

[M(t) N(t)]=exp(t[2αARt R0])[Ip 0][M(t)\ N(t)] = \exp \left( t \begin{bmatrix} 2\alpha A & -R^t \ R & 0 \end{bmatrix} \right) \begin{bmatrix} I_p \ 0 \end{bmatrix}

Both representations maintain computational complexity at O((p+k)3)O((p+k)^3) per matrix exponential, avoiding dependence on the ambient dimension nn when kpk \ll p (Nguyen, 2021).

3. Fréchet Derivatives and Gradient Computation

For the Riemannian logarithm, the problem reduces to finding (A,R)(A,R) such that Y(1)ZY(1) \approx Z, minimizing

F(A,R)=12Y(A,R;1)ZF2F(A, R) = \frac12 \| Y(A, R;1) - Z \|_F^2

The gradient is obtained via Fréchet derivatives: Lf(A,E)=limh0f(A+hE)f(A)h=i=1fia+b=i1AaEAb\mathcal L_f(A, E) = \lim_{h \to 0} \frac{f(A + hE) - f(A)}{h} = \sum_{i=1}^\infty f_i \sum_{a+b=i-1} A^a E A^b This yields gradients with computational cost comparable to evaluating one exponential plus its Fréchet derivative (approximately three times the cost of a single exponential). The trace–Fréchet adjoint identity is applied: $\Tr[ C\,\mathcal L_f(A, E)\,D ] = \Tr[ \mathcal L_f(A, D C)\,E ]$ The gradient splits into blocks: AF=(12α)skew(L˚)+2αskew(L^11),RF=L^21L^12t\nabla_A F = (1-2\alpha)\,\mathrm{skew}(\mathring L) + 2\alpha\,\mathrm{skew}(\hat L_{11}),\qquad \nabla_R F = \hat L_{21} - \hat L_{12}^t with skew(M)=12(MMt)\mathrm{skew}(M) = \frac12(M-M^t), enabling efficient exact gradient evaluation (Nguyen, 2021).

4. Trust-Region Minimization and the Riemannian Logarithm

The Riemannian logarithm is computed by minimizing F(A,R)F(A, R) over antisymmetric AA and arbitrary RR. Standard interior-point or trust-region solvers (e.g., SciPy’s “trust-krylov”) are employed, leveraging the closed-form gradient. When ZZ is within the injectivity radius of YY, the Hessian of FF is positive definite and convergence to the unique minimizing logarithm is rapid. For moderately large distances, the algorithm often still recovers nearly minimal “shooting” geodesics.

This approach extends to quotient manifolds:

  • On St(n,p)/HSt(n,p)/H, for a subgroup HO(p)H \subset O(p) (e.g., Grassmann or flag), restrict AA to the horizontal subspace and adapt cost functions to measure ambient-coordinate distance.
  • The trust-region and Fréchet framework applies identically for these cases (Nguyen, 2021).

5. Grassmann and Flag Manifolds: Trigonometric Formulas

For the Grassmann manifold Gr(p,n)=St(n,p)/O(p)Gr(p, n) = St(n, p) / O(p), the horizontal space is {η:Ytη=0}\{\eta : Y^t\eta = 0\}. For a purely horizontal geodesic (A=0A=0): $\Exp^{Gr}_Y(\eta) = \Exp^{St}_Y(\eta) = Y\,\cos(S^{1/2}) + \eta\,S^{-1/2} \sin(S^{1/2}), \quad S = \eta^t\eta$ Given Y,ZSt(n,p)Y,Z \in St(n,p), compute the SVD: YtZ=UΣVt,Σ=diag(σi)[0,1]p×pY^t Z = U\,\Sigma\,V^t, \quad \Sigma = \mathrm{diag}(\sigma_i) \in [0,1]^{p \times p} The unique minimizing logarithm for all σi(0,1)\sigma_i \in (0,1): $\Log_Y^{Gr}(Z) = (Z - Y Y^t Z)\; V\; (I - \Sigma^2)^{-1/2} \arccos(\Sigma) U^t$ The geodesic distance is

$d_{Gr}(Y, Z) = \sqrt{\alpha_0\,\Tr[\arccos^2(\Sigma)]}$

These are analogous to the cosθ\cos\theta, sinθ\sin\theta formulas on the sphere (Nguyen, 2021).

6. Algorithmic Complexity, Stability, and Applications

Each trust-region iteration requires one or two matrix exponentials (size at most (p+k)(p+k)) plus Fréchet derivatives; cost per exponential is O((p+k)3)O((p+k)^3). For low-rank problems (pnp \ll n), all computations occur in small dimension (kpk \le p). Fréchet-derivative routines (e.g., SciPy's expm_frechet) compute both the exponential and its Fréchet derivative at threefold the cost of the exponential alone.

The approach avoids integration along curves or multi-step shooting, instead solving a small-dimensional nonlinear least-squares problem. Numerical experiments on Stiefel and flag manifolds demonstrate robust convergence for distances up to and beyond half the injectivity radius. The Hessian remains well-conditioned for points near one another, deteriorating only as one approaches the cut locus.

One immediate application is computation of the Riemannian center of mass (Karcher mean): minimizing 1Nid2(X,Qi)\frac1N \sum_i d^2(X, Q_i) by gradient descent, where each step requires NN logarithm computations—now feasible in low-dimension settings. The methods are conceptually and algorithmically unified, supporting practical statistical and optimization tasks on matrix manifolds (Nguyen, 2021).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Closed-Form Geodesics.