首页> 外文会议>International Conference on Communication and Computing Systems >A preliminary comparison of machine learning algorithms for online news feature extraction and analysis
【24h】

A preliminary comparison of machine learning algorithms for online news feature extraction and analysis

机译:在线新闻特征提取和分析机器学习算法的初步比较

获取原文

摘要

Information from social network sites, blogs and web pages generates massive amount of unstructured data. As this big data is generating at faster rate, analysis part is a challenge to data analysis. To manage and extract useful information from this type of data is a cumbersome job. In this paper, we discuss various machine learning techniques to process and handle this data. The advantages and drawbacks each method is stated based on various metrics such as scalability, real-time processing and data size supported. K-means clustering, SVM, and Genetic algorithms are applied on data to get performance impact of features. A comparison of these machine learning techniques has been performed in order to obtain a reasonable algorithm for detection of useful information from online updating resources such as news and tweets. The best result has been achieved by a k-means clustering.
机译:来自社交网站,博客和网页的信息生成大量的非结构化数据。由于这种大数据以更快的速率产生,分析部分是对数据分析的挑战。从这种类型的数据管理和提取有用的信息是一个麻烦的作业。在本文中,我们讨论了各种机器学习技术来处理和处理此数据。基于各种度量来说明各种方法的优点和缺点,例如可伸缩性,实时处理和支持的数据大小。 K-Means Clustering,SVM和遗传算法应用于数据以获得功能的性能影响。已经执行了这些机器学习技术的比较,以便获得来自在线更新资源(例如新闻和推文)的有用信息的合理算法。通过K-Means聚类实现了最佳结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号