(α*(J(θ)θ+β*abJ(θ)θ). ]]> ;The process is repeated a next epoch until the set of conditions are met."/> OPTIMIZATION OF MODEL GENERATION IN DEEP LEARNING NEURAL NETWORKS USING SMARTER GRADIENT DESCENT CALIBRATION
首页> 外国专利> OPTIMIZATION OF MODEL GENERATION IN DEEP LEARNING NEURAL NETWORKS USING SMARTER GRADIENT DESCENT CALIBRATION

OPTIMIZATION OF MODEL GENERATION IN DEEP LEARNING NEURAL NETWORKS USING SMARTER GRADIENT DESCENT CALIBRATION

机译:基于更智能梯度下降校正的深度学习神经网络模型生成优化

摘要

In training a new neural network, batches of the new training dataset are generated. An epoch of batches is passed through the new neural network using an initial weight (θ). An area minimized (Ai) under an error function curve and an accuracy for the epoch are calculated. It is then determined whether a set of conditions are met, where the set of conditions includes whether Ai is less than an average area (A_avg) from a training of an existing neural network and whether the accuracy is within a predetermined threshold. When the set of conditions are not met, a new θ is calculated by modifying a dynamic learning rate (β) by an amount proportional to a ratio of Ai/A_avg and by calculating the new θ using the modified β according to; <math overflow="scroll"><mrow><mrow><mo>(</mo><mrow><mrow><mi>α</mi><mo>*</mo><mfrac><mrow><mo>&#x2202;</mo><mrow><mo>(</mo><mrow><mi>J</mi><mo></mo><mrow><mo>(</mo><mi>θ</mi><mo>)</mo></mrow></mrow></mrow></mrow><mrow><mo>&#x2202;</mo><mi>θ</mi></mrow></mfrac></mrow><mo>+</mo><mrow><mi>β</mi><mo>*</mo><mrow><msubsup><mo>∫</mo><mi>a</mi><mi>b</mi></msubsup><mo></mo><mrow><mrow><mi>J</mi><mo></mo><mrow><mo>(</mo><mi>θ</mi><mo>)</mo></mrow></mrow><mo></mo><mrow><mo>&#x2202;</mo><mi>θ</mi></mrow></mrow></mrow></mrow></mrow><mo>)</mo></mrow><mo>.</mo></mrow></math> ;The process is repeated a next epoch until the set of conditions are met.
机译:在训练新的神经网络时,会生成一批新的训练数据集。使用初始权重(θ)通过新的神经网络传递一批时间。计算误差函数曲线下的最小面积(A i )和时期的精度。然后根据现有神经网络的训练确定是否满足一组条件,其中一组条件包括A i 是否小于平均面积(A_avg)。在预定阈值内。当不满足一组条件时,通过将动态学习率(β)修改为与A i / A_avg之比成比例的量,并使用根据 <![CDATA [<数学溢出=“ scroll”> α * J < mo>( θ θ + β * a b J < / mi> θ θ ]]> ;该过程在下一个时期重复进行,直到满足一组条件为止。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号