...
首页> 外文期刊>Journal of Engineering & Applied Sciences >Integrated Bisect K-Means and Firefly Algorithm for Hierarchical Text Clustering
【24h】

Integrated Bisect K-Means and Firefly Algorithm for Hierarchical Text Clustering

机译:集成的分级k-means和萤火虫算法进行分级式群集

获取原文
获取原文并翻译 | 示例
           

摘要

Hierarchical text clustering plays a significant role in systematically browsing, summarizing and organizing documents into structure manner. However, the Bisect K-means which is a well-known hierarchical clustering algorithm is only able to generate local optimal solutions due to the employment of K-means as part of its process. In this study, we propose to replace the K-means with firefly algorithm, hence producing a Bisect FA for hierarchical clustering. At each level of the proposed Bisect FA, firefly algorithm works to produce the best clusters. For evaluation purposes, we performed experiments on 20 newsgroups dataset that is commonly used in text clustering studies. The results demonstrate that Bisect FA obtains more accurate and compact clustering than Bisect K-means, K-means and C-firefly algorithms. Such a result indicates that the proposed Bisect FA is a competitive algorithm for unsupervised learning.
机译:分层文本群集在系统地浏览,汇总和组织文档中扮演着重要作用。 然而,作为众所周知的分层聚类算法的平分k-isers仅能够产生由于k均值的k-means作为其过程的一部分而产生局部最佳解决方案。 在本研究中,我们建议用萤火虫算法替换K-milit,因此为分层聚类产生了一分的FA。 在所提出的BOST FA的每个级别,Firefly算法都可以生产出最佳集群。 为了评估目的,我们对20个新闻组数据集进行了实验,该数据集通常用于文本聚类研究。 结果表明,二选项FA比双分辨率K-Means,K均值和C-Firefly算法获得更准确和紧凑的聚类。 这样的结果表明,所提出的分发FA是无监督学习的竞争算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号