Sign Based Derivative Filtering for Stochastic Gradient Descent

机译：基于符号的随机梯度下降导数滤波

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study the performance of stochastic gradient descent (SGD) in deep neural network (DNN) models. We show that during a single training epoch the signs of the partial derivatives of the loss with respect to a single parameter are distributed almost uniformly over the minibatches. We propose an optimization routine, where we maintain a moving average history of the sign of each derivative. This history is used to classify new derivatives as "exploratory" if they disagree with the sign of the history. Conversely, we classify the new derivatives as "exploiting" if they agree with the sign of the history. Each derivative is weighed according to our classification, providing control over exploration and exploitation. The proposed approach leads to training a model with higher accuracy as we demonstrate through a series of experiments.

机译：我们研究了深度神经网络（DNN）模型中随机梯度下降（SGD）的性能。我们表明，在单个训练时期内，相对于单个参数的损失偏导数的符号几乎均匀地分布在微型批次上。我们提出了一个优化例程，其中我们维护每个导数符号的移动平均历史。如果新衍生工具与历史记录的标志不一致，则可使用此历史记录将其分类为“探索性”。相反，如果新衍生物与历史的标志一致，我们将其归类为“利用中”。根据我们的分类对每种衍生物进行称重，从而提供对勘探和开发的控制权。正如我们通过一系列实验所证明的那样，所提出的方法导致以更高的精度训练模型。

著录项

来源
《International Conference on Artificial Neural Networks》|2019年|208-219|共12页
会议地点
作者
Konstantin Berestizshevsky; Guy Even;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Optimization; Gradients; Deep learning; Neural networks;

机译：优化;渐变深度学习;神经网络;

相似文献

外文文献
中文文献
专利

1. A proportional-integral-derivative-incorporated stochastic gradient descent-based latent factor analysis model [J] . Li Jinli, Yuan Ye, Ruan Tao, Neurocomputing . 2021,第Feba28期

机译：一种比例 - 积分衍生衍生物的随机梯度下降阶段潜在因子分析模型
2. Improved grid mapping technology based on Rao-Blackwellized particle filters and the gradient descent algorithm [J] . Tengfei Zhang, Chuanjiang Wang, Zhen Yuan, Systems Science & Control Engineering . 2019,第1期

机译：基于RAO-Blackwellized粒子滤波器和梯度下降算法的改进的网格映射技术
3. An Al-based intelligent system for healthcare analysis using Ridge-Adaline Stochastic Gradient Descent Classifier [J] . Deepa N., Prabadevi B., Maddikunta Praveen Kumar, Journal of supercomputing . 2021,第2期

机译：用于使用脊型 - 亚尾随机梯度下降分类的医疗分析智能系统
4. Sign Based Derivative Filtering for Stochastic Gradient Descent [C] . Konstantin Berestizshevsky, Guy Even International Conference on Artificial Neural Networks . 2019

机译：基于符号的衍生滤波，用于随机梯度下降
5. The Advantage of Custom Microprocessors for Stochastic Gradient Descent in Graph-Based Robot Localization and Mapping [D] . Guo, Sung-Yee. 2018

机译：定制微处理器在基于图形的机器人定位和映射中的随机梯度下降的优势
6. Mutual Information Based Learning Rate Decay for Stochastic Gradient Descent Training of Deep Neural Networks [O] . Shrihari Vasudevan 2020

机译：基于互动信息的学习速率衰减用于深神经网络的随机梯度血统训练
7. Distributed stochastic gradient descent for link prediction in signed social networks [O] . Han Zhang, Gang Wu, Qing Ling 2019

机译：签署社交网络中链路预测的分布式随机梯度下降

Sign Based Derivative Filtering for Stochastic Gradient Descent

摘要

著录项

相似文献

相关主题

期刊订阅