Stochastic Difference of Convex Algorithm and its Application to Training Deep Boltzmann Machines

Atsushi Nitanda; Taiji Suzuki

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Stochastic Difference of Convex Algorithm and its Application to Training Deep Boltzmann Machines

【24h】

Stochastic Difference of Convex Algorithm and its Application to Training Deep Boltzmann Machines

机译：凸算法的随机差异及其在训练深玻尔兹曼机中的应用

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Difference of convex functions (DC) programming is an important approach to nonconvex optimization problems because these structures can be encountered in several fields. Effective optimization methods, called DC algorithms, have been developed in deterministic optimization literature. In machine learning, a lot of important learning problems such as the Boltzmann machines (BMs) can be formulated as DC programming. However, there is no DC-like algorithm guaranteed by convergence rate analysis for stochastic problems that are more suitable settings for machine learning tasks. In this paper, we propose a stochastic variant of DC algorithm and give computational complexities to converge to a stationary point under several situations. Moreover, we show our method includes expectation-maximization (EM) and Monte Carlo EM (MCEM) algorithm as special cases on training BMs. In other words, we extend EM/MCEM algorithm to more effective methods from DC viewpoint with theoretical convergence guarantees. Experimental results indicate that our method performs well for training binary restricted Boltzmann machines and deep Boltzmann machines without pre-training.

机译：凸函数（DC）编程的差异是解决非凸优化问题的一种重要方法，因为这些结构可以在多个领域中遇到。在确定性优化文献中已经开发了称为DC算法的有效优化方法。在机器学习中，许多重要的学习问题（例如玻耳兹曼机器（BM））可以表述为DC编程。但是，对于收敛速度分析，并没有针对随机问题的类DC算法，该算法更适合用于机器学习任务。在本文中，我们提出了DC算法的随机变体，并给出了在几种情况下收敛到平稳点的计算复杂性。此外，我们展示了我们的方法，包括期望最大化（EM）和蒙特卡洛EM（MCEM）算法，作为训练BM的特殊情况。换句话说，我们从DC的角度出发将EM / MCEM算法扩展到更有效的方法，并且具有理论上的收敛性保证。实验结果表明，该方法在不进行预训练的情况下，对二元受限玻尔兹曼机和深部玻尔兹曼机的训练效果很好。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2017年第1期|共9页
作者
Atsushi Nitanda; Taiji Suzuki;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Deep-FS: A feature selection algorithm for Deep Boltzmann Machines [J] . Taherkhani Aboozar, Cosma Georgina, McGinnity T. M. Neurocomputing . 2018,第DECa17期

机译：Deep-FS：Deep Boltzmann机器的特征选择算法
2. Effective fine-tuning training of deep Boltzmann machine based on spatial Monte Carlo integration [J] . Tomu Katsumata, Muneki Yasuda Nonlinear Theory and Its Applications . 2021,第3期

机译：基于空间Monte Carlo集成的深螺栓曼机的有效微调训练
3. A two-dimensional stochastic algorithm for the solution of the non-linear Poisson-Boltzmann equation: validation with finite-difference benchmarks [J] . Chatterjee K, Poggie J International Journal for Numerical Methods in Engineering . 2006,第1期

机译：求解非线性Poisson-Boltzmann方程的二维随机算法：使用有限差分基准进行验证
4. A tutorial on stochastic approximation algorithms for training Restricted Boltzmann Machines and Deep Belief Nets [C] . Swersky K., Bo Chen, Marlin B., Information Theory and Applications Workshop (ITA), 2010 . 2010

机译：训练受限Boltzmann机器和Deep Belief网络的随机逼近算法教程
5. Stochastic Gradient Descent for Modern Machine Learning: Theory, Algorithms and Applications [D] . Kidambi, Rahul. 2019

机译：现代机器学习的随机梯度下降：理论，算法和应用
6. Robust Multicategory Support Vector Machines using Difference Convex Algorithm [O] . Chong Zhang, Minh Pham, Sheng Fu, -1

机译：基于差分凸算法的鲁棒多类别支持向量机
7. A tutorial on stochastic approximation algorithms for training Restricted Boltzmann Machines and Deep Belief Nets [O] . Kevin Swersky, Bo Chen, Ben Marlin, 2010

机译：训练受限Boltzmann机器和Deep Belief网络的随机逼近算法教程

Stochastic Difference of Convex Algorithm and its Application to Training Deep Boltzmann Machines

摘要

著录项

相似文献

相关主题

期刊订阅