Revisiting Normalized Gradient Descent: Fast Evasion of Saddle Points

Murray Ryan; Swenson Brian; Kar Soummya

首页> 外文期刊>IEEE Transactions on Automatic Control >Revisiting Normalized Gradient Descent: Fast Evasion of Saddle Points

【24h】

Revisiting Normalized Gradient Descent: Fast Evasion of Saddle Points

机译：回顾归一化梯度下降：鞍点的快速回避

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The paper considers normalized gradient descent (NGD), a natural modification of classical gradient descent (GD) in optimization problems. It is shown that, contrary to GD, NGD escapes saddle points "quickly." A serious shortcoming of GD in nonconvex problems is that it can take arbitrarily long to escape from the neighborhood of a saddle point. In practice, this issue can significantly slow the convergence of GD, particularly in high-dimensional nonconvex problems. The paper focuses on continuous-time dynamics. It is shown that 1) NGD "almost never" converges to saddle points and 2) the time required for NGD to escape from a ball of radius r about a saddle point x* is at most 5 root kappa r, where kappa is the condition number of the Hessian of f at x*. As a simple application of these results, a global convergence-time bound is established for NGD under mild assumptions.

机译：本文考虑归一化梯度下降（NGD），这是优化问题中经典梯度下降（GD）的自然修改。结果表明，与GD相反，NGD可以“迅速”逃脱鞍点。 GD在非凸问题中的一个严重缺陷是从鞍点附近逸出可能要花费很长时间。实际上，此问题可能会大大降低GD的收敛速度，尤其是在高维非凸问题中。本文着重于连续时间动力学。结果表明：1）NGD“几乎从不”收敛到鞍点； 2）NGD从半径为r的球中约鞍点x *逃逸所需的时间最多为5个根kappa r，其中kappa是条件f在x *处的Hessian数。作为这些结果的简单应用，在温和的假设下为NGD建立了全局收敛时间界限。

著录项

来源
《IEEE Transactions on Automatic Control》 |2019年第11期|4818-4824|共7页
作者
Murray Ryan; Swenson Brian; Kar Soummya;
展开▼
作者单位

Penn State Univ Dept Math State Coll PA 16801 USA;

Princeton Univ Dept Elect Engn Princeton NJ 08544 USA;

Carnegie Mellon Univ Dept Elect & Comp Engn Pittsburgh PA 15213 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Gradient methods; machine learning; nonconvex optimization; optimization;

机译：梯度法机器学习非凸优化;优化;

相似文献

外文文献
中文文献
专利

1. Accelerated Gradient Descent Escapes Saddle Points Faster than Gradient Descent [J] . Chi Jin, Praneeth Netrapalli, Michael I. Jordan JMLR: Workshop and Conference Proceedings . 2018,第12期

机译：加速梯度下降比梯度下降更快地逃避了鞍点
2. Accelerated Gradient Descent Escapes Saddle Points Faster than Gradient Descent [J] . Chi Jin, Praneeth Netrapalli, Michael I. Jordan JMLR: Workshop and Conference Proceedings . 2017,第1期

机译：加速梯度下降比梯度下降更快地逃避了鞍点
3. Numerical optimization for the calculus of variations by gradients on non-Hilbert Sobolev spaces using conjugate gradients and normalized differential equations of steepest descent [J] . Ivie Stein Jr. Nonlinear Analysis: An International Multidisciplinary Journal . 2009,第12期

机译：使用共轭梯度和最速下降归一化微分方程对非希尔伯特Sobolev空间上的梯度微积分进行数值优化
4. Escaping Saddle Points for Zeroth-order Non-convex Optimization using Estimated Gradient Descent [C] . Qinbo Bai, Mridul Agarwal, Vaneet Aggarwal Annual Conference on Information Sciences and Systems . 2020

机译：使用估计的梯度下降为零阶非凸优化转义鞍点
5. Collaborative Team Evasion Against a Faster Pursuer. [D] . Liu, Shih-Yuan. 2014

机译：协作团队逃避更快的追求者。
6. Limited-Memory Fast Gradient Descent Method for Graph Regularized Nonnegative Matrix Factorization [O] . Naiyang Guan, Lei Wei, Zhigang Luo, -1

机译：图正则化非负矩阵分解的有限内存快速梯度下降方法
7. A Normalized Gradient Descent Algorithm for Nonlinear Adaptive Filters Using a Gradient Adaptive Step Size [O] . Danilo P. M, Andrew I. Hanna, Moe Razaz 2016

机译：基于梯度自适应步长的非线性自适应滤波器的归一化梯度下降算法
8. Fast Gravity, Gravity Partials, Normalized Gravity, Gravity Gradient Torque andMagnetic Field: Derivation, Code and Data [R] . Gottlieb, R. G. 1993

机译：快速重力，重力偏振，归一化重力，重力梯度扭矩和磁场：推导，代码和数据

Revisiting Normalized Gradient Descent: Fast Evasion of Saddle Points

摘要

著录项

相似文献

相关主题

期刊订阅