Papers
Topics
Authors
Recent
Search
2000 character limit reached

Convergence Rates of Stochastic Zeroth-order Gradient Descent for Ł ojasiewicz Functions

Published 31 Oct 2022 in math.OC and cs.LG | (2210.16997v6)

Abstract: We prove convergence rates of Stochastic Zeroth-order Gradient Descent (SZGD) algorithms for Lojasiewicz functions. The SZGD algorithm iterates as \begin{align*} \mathbf{x}{t+1} = \mathbf{x}_t - \eta_t \widehat{\nabla} f (\mathbf{x}_t), \qquad t = 0,1,2,3,\cdots , \end{align*} where $f$ is the objective function that satisfies the \L ojasiewicz inequality with \L ojasiewicz exponent $\theta$, $\eta_t$ is the step size (learning rate), and $ \widehat{\nabla} f (\mathbf{x}_t) $ is the approximate gradient estimated using zeroth-order information only. Our results show that $ { f (\mathbf{x}_t) - f (\mathbf{x}\infty) }{t \in \mathbb{N} } $ can converge faster than $ { | \mathbf{x}_t - \mathbf{x}\infty | }_{t \in \mathbb{N} }$, regardless of whether the objective $f$ is smooth or nonsmooth.

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Authors (2)

Collections

Sign up for free to add this paper to one or more collections.