Papers
Topics
Authors
Recent
Search
2000 character limit reached

Gradient descent avoids strict saddles with a simple line-search method too

Published 18 Jul 2025 in math.OC, cs.NA, math.DS, and math.NA | (2507.13804v1)

Abstract: It is known that gradient descent (GD) on a $C2$ cost function generically avoids strict saddle points when using a small, constant step size. However, no such guarantee existed for GD with a line-search method. We provide one for a modified version of the standard Armijo backtracking method with generic, arbitrarily large initial step size. In contrast to previous works, our analysis does not require a globally Lipschitz gradient. We extend this to the Riemannian setting (RGD), assuming the retraction is real analytic (though the cost function still only needs to be $C2$). In closing, we also improve guarantees for RGD with a constant step size in some scenarios.

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 4 likes about this paper.