Papers
Topics
Authors
Recent
Search
2000 character limit reached

Forsythe Conjecture & Restarted Krylov Methods

Updated 18 February 2026
  • Forsythe Conjecture is a hypothesis on the asymptotic behavior of restarted Krylov methods, focusing on convergence patterns of normalized residuals.
  • Modern interpretations leverage Krylov subspace and Arnoldi projections to extend the conjecture to symmetric, orthogonal, and nonsymmetric matrices.
  • The conjecture remains largely open for restart lengths s ≥ 2, presenting challenges in proving convergence uniqueness and characterizing limit sets.

The Forsythe Conjecture concerns the asymptotic behavior of restarted Krylov subspace methods, specifically the conjugate gradient (CG) method, for solving linear systems or minimizing quadratic forms. Originally formulated by G.E. Forsythe in 1968 for symmetric positive-definite (SPD) matrices, it hypothesizes strict regularities in the convergence pattern of the normalized residuals under periodic CG restarting, with significant implications for the structure and performance of iterative solvers. Modern developments reinterpret and generalize the conjecture to cover symmetric, orthogonal, and nonsymmetric matrices via advanced Krylov-based and Arnoldi-type projection frameworks (Faber et al., 2022).

1. Original Formulation and Historical Context

Forsythe's initial setting was the minimization of f(x)=12xTAxxTbf(x)=\frac{1}{2}x^{\mathrm{T}}A x - x^{\mathrm{T}}b with ARn×nA \in \mathbb{R}^{n \times n} symmetric positive definite and bRnb \in \mathbb{R}^n, corresponding to solving Ax=bAx = b via the CG method. He introduced periodic restarting of CG every ss steps, known as the optimal ss-gradient method. Iterates xkx_k and residuals rk=bAxkr_k = b - Ax_k are generated, with yk=rk/rky_k = r_k/\|r_k\|.

For s=1s=1, Forsythe established rigorous convergence: the normalized residuals {yk}\{y_k\} alternate between two limiting directions. For s=2s=2, numerical evidence showed a similar two-directional limit cycle but a general proof was absent. The conjecture posits that for 2s<d(A)2 \leq s < d(A), where d(A)d(A) is the minimal polynomial degree of AA, both subsequences {y2k}\{y_{2k}\} and {y2k+1}\{y_{2k+1}\} each converge to a single unit vector. This assertion remains largely unproven except for certain low values of ss and specific matrix classes (Faber et al., 2022).

2. Modern Krylov-Subspace Interpretation

The conjecture has been translated into Krylov-subspace language, aligning with contemporary understanding of iterative solvers. The restarted CG process is recast as follows: at each restart, xk+1x_{k+1} is the AA-inner-product orthogonal projection of the true solution onto the affine subspace xk+Ks(A,rk)x_k + \mathcal{K}_s(A, r_k), where Ks(A,rk)\mathcal{K}_s(A, r_k) is the Krylov subspace of order ss generated by AA and rkr_k. Explicitly,

xk+1=argminzxk+Ks(A,rk)xzAx_{k+1} = \arg\min_{z \in x_k + \mathcal{K}_s(A, r_k)} \|x - z\|_A

with corresponding residuals and normalized directions.

The modern Forsythe conjecture (for symmetric AA) states: given 1s<d(A)1 \leq s < d(A) and initial x0x_0 such that d(A,r0)s+1d(A, r_0) \geq s + 1, the sequences {y2k}\{y_{2k}\} and {y2k+1}\{y_{2k+1}\} converge to unit vectors yeveny_{\text{even}} and yoddy_{\text{odd}} respectively as kk \to \infty, so the residual directions asymptotically oscillate between two vectors (Faber et al., 2022).

3. Generalization via Arnoldi Cross Iteration

Extending beyond SPD matrices, the conjecture has been generalized using projections inspired by the Arnoldi (and for symmetric cases, the Lanczos) algorithm. For arbitrary ARn×nA \in \mathbb{R}^{n \times n} and vv with d(A,v)>sd(A, v) > s, the Arnoldi projection Ps(A;v)P_s(A; v) yields w=q(A)vw = q(A)v where qMsq \in \mathcal{M}_s is a monic polynomial of degree ss minimizing norm, and ww is orthogonal to Ks(A,v)\mathcal{K}_s(A, v).

Algorithmically, the Arnoldi Cross Iteration with restart ss ("ACI(s)") alternates projections with AA and ATA^\mathrm{T}:

  1. wk=Ps(A;vk)vk/Ps(A;vk)vkw_k = P_s(A; v_k) v_k / \|P_s(A; v_k) v_k\|
  2. vk+1=Ps(AT;wk)wk/Ps(AT;wk)wkv_{k+1} = P_s(A^\mathrm{T}; w_k) w_k / \|P_s(A^\mathrm{T}; w_k) w_k\|

For symmetric AA this reduces to a Lanczos-type projection sequence on the sphere. The generalized conjecture claims that, under the same degree condition, the sequence {vk}\{v_k\} converges to a single limit vector even in nonsymmetric settings, although results for s2s \geq 2 remain largely open (Faber et al., 2022).

4. Theoretical Results and Limit Set Structure

Key verified theoretical results concerning ACI(s) and the Forsythe conjecture include:

  • Monotonicity and Bound (Theorem 2.3): All vectors in ACI(s) sequences are well defined, and wkvk+1wk+1vk+2Φs(A)\|w_k\| \leq \|v_{k+1}\| \leq \|w_{k+1}\| \leq \|v_{k+2}\| \leq \Phi_s(A) for all k0k \geq 0, with Φs(A)=maxv=1minqMsq(A)v\Phi_s(A) = \max_{\|v\|=1} \min_{q \in \mathcal{M}_s} \|q(A)v\|.
  • Vanishing Differences (Theorem 2.5): wk+1wk0\|w_{k+1} - w_k\| \to 0 and vk+1vk0\|v_{k+1} - v_k\| \to 0 as kk \to \infty.
  • Limit Set Properties (Theorem 3.8): The set ΣA\Sigma_*^A of all limit points of {vk}\{v_k\} is nonempty, closed, and connected on the unit sphere. Every limit vv_* must satisfy v=TAT(TA(v))v_* = T_{A^\mathrm{T}}(T_A(v_*)), where TA(v)=Ps(A;v)v/Ps(A;v)vT_A(v) = P_s(A; v) v / \|P_s(A; v) v\|.

For symmetric AA: - If d(A,v0)=s+1d(A,v_0) = s+1, then v0=v2=v4=v_0 = v_2 = v_4 = \ldots (Theorem 4.2). - Every limit vector vv_* satisfies (Q2s(A;v)τ2I)v=0(Q_{2s}(A; v_*) - \tau^2 I) v_* = 0 for s<d(A,v)2ss < d(A, v_*) \leq 2s (Theorem 4.4), an interpolation relation. - For s=1s=1, the sequence {v2k}\{v_{2k}\} converges uniquely (Theorem 4.7).

For orthogonal AA with s=1s=1 and 0F(A)0 \notin F(A), any limit vv_* has grade 2 and corresponds to the real part of a conjugate-pair of eigenvalues (Lemma 4.9), and the full sequence converges under mild conditions (Theorem 4.12) (Faber et al., 2022).

5. Open Problems and Research Directions

Despite advances for the s=1s=1 case, key challenges persist, particularly for higher restart lengths (s2s \geq 2):

  • For s2s \geq 2 and symmetric AA, uniqueness of the limit point is unproved in general. Previous arguments for s=2s=2 rely on a coefficient-convergence result attributed to Zabolotskaya (1979), whose full rigor remains controversial (Faber et al., 2022).
  • For nonsymmetric ACI(s), convergence beyond s=1s=1 is open, due in part to lack of simple monotonicity in norm and the possibility of higher-dimensional limit sets.
  • A promising avenue combines finite enumeration of candidate limit polynomials (of degree at most $2s$) with proof that coefficient convergence implies vector convergence.
  • Connections to two-sided Rayleigh quotient and Alternating Rayleigh Quotient Iteration (ARQI) methodologies may provide new monotonicity or contraction arguments.

A plausible implication is that resolving these open questions would significantly advance understanding of the limiting dynamics of restarted Krylov methods.

6. Current State of Knowledge

The state of results on the Forsythe conjecture and its generalizations can be summarized as follows:

Case Status Notes/Provenance
s=1s=1, A=ATA = A^\mathrm{T} (symmetric) Proven (unique 2-cycle) Akaike (1959), Afanasjew et al. (2008), (Faber et al., 2022)
s=1s=1, AA orthogonal, 0F(A)0 \notin F(A) Proven (full convergence) (Faber et al., 2022)
s2s\geq 2, symmetric AA Largely open Evidence for s=2s=2, full proof lacking
s2s\geq 2, nonsymmetric AA (ACI(s)) Open Limit set structure may be higher-dimensional

Although the case s=1s=1 is now rigorously understood in both SPD and some orthogonal settings, the Forsythe conjecture for s2s \geq 2 remains a central open problem, whose resolution likely requires novel insights into the algebraic structure and convergence properties of Krylov-projection polynomials and their limits (Faber et al., 2022).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Forsythe Conjecture.