可扩展机器学习的并行与分布式优化算法综述

亢良伊; 王建飞; 刘杰; 叶丹

首页> 中文期刊> 《软件学报》 >可扩展机器学习的并行与分布式优化算法综述

可扩展机器学习的并行与分布式优化算法综述

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

机器学习问题通常会转换成一个目标函数去求解,优化算法是求解目标函数中参数的重要工具.在大数据环境下,需要设计并行与分布式的优化算法,通过多核计算和分布式计算技术来加速训练过程.近年来,该领域涌现了大量研究工作,部分算法也在各机器学习平台得到广泛应用,针对梯度下降算法、二阶优化算法、邻近梯度算法、坐标下降算法、交替方向乘子算法这5类最常见的优化方法展开研究,每一类算法分别从单机并行和分布式并行来分析相关研究成果,并从模型特性、输入数据特性、算法评价、并行计算模型等角度对每种算法进行详细对比.随后,对有代表性的可扩展机器学习平台中优化算法的实现和应用情况进行对比分析.同时,对所介绍的所有优化算法进行多层次分类,方便用户根据目标函数类型选择合适的优化算法,也可以通过该多层次分类图交叉探索如何将优化算法应用到新的目标函数类型.最后分析了现有优化算法存在的问题,提出可能的解决思路,并对未来研究方向进行展望.%Machine learning problems can be viewed as optimization-centric programs,and the optimization algorithm is an important tool to solve the objective function.In the era of big data,in order to speed up the training process,it is essential to design parallel and distributed optimization algorithms by multi-core computing and distributed computing technologies.In recent years,there are a lot of research works in this field,and some algorithms have been widely applied on machine learning platforms.In this paper,five common optimization algorithms,including gradient descent algorithm,second order optimization algorithm,proximal gradient algorithm,coordinate descent algorithm and alternating direction method of multiplier,are studied.Each type of algorithm is analyzed from the view of parallel and distributed respectively,and algorithms of the same type are compared by their model type,input data characteristic,algorithm evaluation and parallel communication mode.In addition,the implementations and applications of the optimization algorithm on representative scalable machine learning platforms are analyzed.Meanwhile,all the optimization algorithms introduced in this paper are categorized by a hierarchical classification diagram,which can be used as a tool to select the appropriate optimization algorithm according to the objective function type,and also to cross explore how to apply optimization algorithms to the new objective function type.Finally,the problems of the existing optimization algorithms are discussed,and the possible solutions and the future research directions are proposed.

著录项

来源
《软件学报》 |2018年第1期|109-130|共22页
作者
亢良伊; 王建飞; 刘杰; 叶丹;
展开▼
作者单位

中国科学院软件研究所软件工程技术研发中心;

北京 100190;

中国科学院大学;

北京 100190;

中国科学院软件研究所软件工程技术研发中心;

北京 100190;

中国科学院大学;

北京 100190;

中国科学院软件研究所软件工程技术研发中心;

北京 100190;

计算机科学国家重点实验室(中国科学院软件研究所);

北京 100190;

中国科学院软件研究所软件工程技术研发中心;

北京 100190;

展开▼
原文格式 PDF
正文语种 chi
中图分类自动推理、机器学习;
关键词
机器学习; 优化算法; 并行算法; 分布式算法;

相似文献

中文文献
外文文献
专利

1. 流水行云:支持可扩展的并行分布式流处理系统 [J] . 张鹏 ,刘庆云 ,谭建龙 . 电子学报 . 2015,第004期
2. 分布式计算中可扩展的并行I/O数据分配策略研究 [J] . 曾碧卿 ,陈志刚 ,谭璐 . 小型微型计算机系统 . 2005,第010期
3. 分布式存储环境下并行计算可扩展性的研究与应用 [J] . 陈军 . 计算机工程与科学 . 2001,第4期
4. 面向机器学习的分布式并行计算关键技术及应用 [J] . 曹嵘晖 ,唐卓 ,左知微 . 智能系统学报 . 2021,第005期
5. 并行机器学习算法基础体系前沿进展综述 [J] . 刘斌 ,何进荣 ,耿耀君 . 计算机工程与应用 . 2017,第011期
6. 并行机器学习算法前沿进展综述 [C] . Liu Bin ,刘斌 ,He Jinrong . 2016年全国高性能计算学术年会 . 2016
7. 面向大规模机器学习的分布式优化算法研究 [A] . 梁先锋 . 2021

可扩展机器学习的并行与分布式优化算法综述

摘要

著录项

相似文献

相关主题

期刊订阅