首页> 外国专利> An Effective Primal-Dual Stochastic Distributed Strategy for Large-Scale Machine Learning

An Effective Primal-Dual Stochastic Distributed Strategy for Large-Scale Machine Learning

机译:大规模机器学习的一种有效的原-对偶随机分布策略

摘要

#$%^&*AU2019101711A420200213.pdf#####Abstract In this patent, we propose a novel primal-dual stochastic distributed algorithm for convex optimization problem with private set constraints in multi-agent networked system, where overall nodes aim at collectively minimizing the sum of all local objective functions. Motivated by a variety of applications in machine learning problems with large-scale training sets distributed to multiple autonomous nodes, each local objective function is further designed as the average of moderate number of local instantaneous functions. The algorithm mainly comprises four stages including setting related parameters and initial values of variables; computing stochastic gradients; exchanging information; updating variables. The algorithm set forth in the present invention performs update of each node's state by resorting to unbiased stochastic averaging gradients and projection techniques. Specifically, for each node the gradient of one local instantaneous function selected randomly is evaluated and the average of the most recent stochastic gradients is used to approximate the true local gradient at each iteration. Therefore, the algorithm can significantly reduce the evaluation cost of gradients of local objective functions and improve the computation efficiency in modern large-scale information processing problem, especially with high dimension.4/4 2 - EXTRA - DSA S- - The proposed algorithm •.•••• DGD with a constant step size * - - DGD with a diminishing step size -2 - - - - - - - -_0 C) -6 -8 -10' 0 2 4 6 8 10 simulation time (second) Figure 5 (a) g1 (b) 92 Figure 6
机译:#$%^&* AU2019101711A420200213.pdf #####抽象在该专利中,我们提出了一种新颖的凸对偶随机对偶分布算法多主体网络系统中具有私有集约束的优化问题,其中总体节点的目标是使所有局部目标函数的总和最小化。莫蒂大型训练集可解决机器学习问题中的各种应用分布到多个自治节点,进一步设计每个局部目标函数作为中等数量的局部瞬时函数的平均值。该算法主要包括设置相关参数和变量初始值四个阶段。计算随机梯度;交流信息;更新变量。算法本发明中提出的方法通过使用unbi来执行每个节点状态的更新。ased随机平均梯度和投影技术。具体来说,对于每个节点评估随机选择的一个局部瞬时函数的梯度,并求平均值最近的随机梯度中的一个用于逼近每个位置处的真实局部梯度迭代。因此,该算法可以大大降低梯度的评估成本目标函数的求解并提高现代大规模计算的效率信息处理问题,尤其是高维度。4/42-额外-DSAS--提出的算法•。••••DGD,步长恒定*--DGD,步长逐渐减小-2--------_0C)-6-8-10'0 2 4 6 8 10模拟时间(秒)图5(a)g1(b)92图6

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号