...
首页> 外文期刊>Journal of Computers >A New Sub-topic Clustering Method Based on Semi-supervised Learning
【24h】

A New Sub-topic Clustering Method Based on Semi-supervised Learning

机译:基于半监督学习的新子主题聚类方法

获取原文
           

摘要

—Sub-topic clustering is a crucial step in multidocument summarization. The traditional k-means clustering method is not effective for topic clustering because the number of clusters k must be given in advance. This paper describes a new method for sub-topic clustering based on semi-supervised learning: the method firstly partition the set of sentences into disjoint subsets, each of which contained sentences covering exactly one topic, and labels the sentences which have high scores in the topic, then use the method of constrained-k-means to decide the number of topics, and finally get the sub-topic sets by k- Means clustering. This algorithm can dynamically generate the number of k-means clustering, and the experiment result indicates that the accuracy of clustering is improved.
机译:-sub-topate群集是MultiDocument综合化的一个重要步骤。传统的K-means聚类方法对于主题群集无效,因为必须提前给出群集k的数量。本文介绍了一种基于半监督学习的子主题聚类的新方法:该方法首先将该组句子分隔为不相交的子集,每个句子都包含句子恰好一个主题,并标记了具有高分的句子主题,然后使用受约束k-means的方法来决定主题的数量,最后通过k-表示群集获取子主题集。该算法可以动态地生成K-means群集的数量,实验结果表明聚类的准确性得到改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号