Skip to main content

Third-order smoothness helps: Faster stochastic optimization algorithms for finding local minima

Publication ,  Conference
Yu, Y; Xu, P; Gu, Q
Published in: Advances in Neural Information Processing Systems
January 1, 2018

We propose stochastic optimization algorithms that can find local minima faster than existing algorithms for nonconvex optimization problems, by exploiting the third-order smoothness to escape non-degenerate saddle points more efficiently. More specifically, the proposed algorithm only needs Oe(ℰ-10/3) stochastic gradient evaluations to converge to an approximate local minimum x, which satisfies krf(x)k2 ≤ℰ and λmin(r2f(x)) ≥ -pℰ in unconstrained stochastic optimization, where Oe(·) hides logarithm polynomial terms and constants. This improves upon the Oe(ℰ-7/2) gradient complexity achieved by the state-of-the-art stochastic local minima finding algorithms by a factor of Oe(ℰ-1/6). Experiments on two nonconvex optimization problems demonstrate the effectiveness of our algorithm and corroborate our theory.

Duke Scholars

Published In

Advances in Neural Information Processing Systems

ISSN

1049-5258

Publication Date

January 1, 2018

Volume

2018-December

Start / End Page

4525 / 4535

Related Subject Headings

  • 4611 Machine learning
  • 1702 Cognitive Sciences
  • 1701 Psychology
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Yu, Y., Xu, P., & Gu, Q. (2018). Third-order smoothness helps: Faster stochastic optimization algorithms for finding local minima. In Advances in Neural Information Processing Systems (Vol. 2018-December, pp. 4525–4535).

Published In

Advances in Neural Information Processing Systems

ISSN

1049-5258

Publication Date

January 1, 2018

Volume

2018-December

Start / End Page

4525 / 4535

Related Subject Headings

  • 4611 Machine learning
  • 1702 Cognitive Sciences
  • 1701 Psychology