首页> 美国卫生研究院文献>Entropy >Mutual Information Based Learning Rate Decay for Stochastic Gradient Descent Training of Deep Neural Networks

【2h】

Mutual Information Based Learning Rate Decay for Stochastic Gradient Descent Training of Deep Neural Networks

机译：基于互动信息的学习速率衰减用于深神经网络的随机梯度血统训练

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper demonstrates a novel approach to training deep neural networks using a Mutual Information (MI)-driven, decaying Learning Rate (LR), Stochastic Gradient Descent (SGD) algorithm. MI between the output of the neural network and true outcomes is used to adaptively set the LR for the network, in every epoch of the training cycle. This idea is extended to layer-wise setting of LR, as MI naturally provides a layer-wise performance metric. A LR range test determining the operating LR range is also proposed. Experiments compared this approach with popular alternatives such as gradient-based adaptive LR algorithms like Adam, RMSprop, and LARS. Competitive to better accuracy outcomes obtained in competitive to better time, demonstrate the feasibility of the metric and approach.

著录项

期刊名称 Entropy
作者
Shrihari Vasudevan;
展开▼
作者单位

展开▼
年(卷),期 2020(22),5
年度 2020
页码 560
总页数 15
原文格式 PDF
正文语种
中图分类
关键词
deep neural networks; stochastic gradient descent; mutual information; adaptive learning rate;

机译：深度神经网络;随机梯度下降;互信息;自适应学习率;

相似文献

外文文献
中文文献
专利

1. Layer-wise learning based stochastic gradient descent method for the optimization of deep convolutional neural network [J] . Zheng Qinghe, Tian Xinyu, Jiang Nan, Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2019,第4aPta2期

机译：基于层性学习的随机梯度渐变方法，用于优化深卷积神经网络
2. Non-convergence of stochastic gradient descent in the training of deep neural networks [J] . Cheridito Patrick, Jentzen Arnulf, Rossmannek Florian Journal of complexity . 2021,第Juna期

机译：深神经网络训练中随机梯度下降的非融合
3. Accelerating deep neural network training with inconsistent stochastic gradient descent [J] . Wang Linnan, Yang Yi, Min Renqiang, Neural Networks: The Official Journal of the International Neural Network Society . 2017,第期

机译：加速深度神经网络训练，随机梯度下降不一致
4. Exploring one pass learning for deep neural network training with averaged stochastic gradient descent [C] . You Zhao, Wang Xiaorui, Xu Bo IEEE International Conference on Acoustics, Speech and Signal Processing . 2014

机译：探索具有平均随机梯度下降的深度神经网络训练的一遍学习
5. An Investigation of Stochastic Gradient Descent Dynamics of Neural Networks [D] . Luo, Victor. 2021

机译：神经网络随机梯度下降动力学研究
6. Stochastic Channel-Based Federated Learning With Neural Network Pruning for Medical Data Privacy Preservation: Model Development and Experimental Validation [O] . Rulin Shao, Hongyu He, Ziwei Chen, 2020

机译：基于随机频道的联合学习具有医学数据隐私保留的神经网络修剪：模型开发和实验验证
7. Mutual Information Based Learning Rate Decay for Stochastic Gradient Descent Training of Deep Neural Networks [O] . Shrihari Vasudevan 2020

机译：基于互动信息的学习速率衰减，用于深神经网络的随机梯度血统训练

Mutual Information Based Learning Rate Decay for Stochastic Gradient Descent Training of Deep Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅