首页> 中文期刊> 《计算机工程与科学》 >基于Hadoop平台的分布式SVM参数寻优

基于Hadoop平台的分布式SVM参数寻优

         

摘要

参数的选择对算法分类与预测的正确率有直接影响.在参数选择中全局网格搜索有着计算可靠、简单、优化效果明显的优势,适合应用于可靠性要求高的工程运算,如在复杂系统的故障诊断中对故障模式识别算法进行参数寻优等.但是,全局网格搜索在寻优过程中耗时过长,仍然是一个制约其使用的问题,尤其对于实时性要求较高的系统.以支持向量机的参数全局寻优问题为例,针对网格搜索寻优时间长的缺点,利用Hadoop平台进行分布式参数寻优,借助HDFS将参数自动划分到计算节点上,并运用MapReduce计算框架建立分布式参数寻优模型,完成模型训练预测及参数优化.实验结果表明,在不降低算法性能的前提下提高了寻优效率.%The classification and prediction accuracy of an algorithm are directly influenced by the choice of parameters,and among the methods of parameter selection,global grid search has obvious advantages,such as reliable and simple calculation,and obvious optimization effect,which are suitable for engineering operations that have high reliability requirement,for instance,parameter optimization of the fault pattern recognition algorithm in fault diagnosis of system.However,the global grid search is timeconsuming in the search process,therefore there is still a constraint on use,especially for the system which has high real-time requirement.Using the global parameter optimization of support vector machine as a case,Hadoop platform is used for distributed parameter optimization in order to overcome the disadvantage of grid search.With HDFS,the parameters can be automatically divided into calculation nodes.We establish the distributed parameter optimization model by using the MapReduce computing framework,then conduct model training and prediction as well as parameter optimization.Experimental results show that the optimization efficiency is improved without reducing algorithm performance.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号