...
首页> 外文期刊>International Journal of Computer Trends and Technology >A Novel Metric for Measuring Similarity for Document Clustering
【24h】

A Novel Metric for Measuring Similarity for Document Clustering

机译:一种衡量文档聚类相似性的新指标

获取原文
           

摘要

Clustering is one of the data mining techniques which hasimportant utility in real time applications. Cluster is a group of objects with highest similarity. The clustering results can be used further in many applications including query processing. Clustering can also be used in text mining. The existing clustering algorithms in this domain use single viewpoint to find the similarity between object. However, the singe view point similarity measure cannot have highly informative assessment of similarity. In this paper we propose and implement a novel measure known as multiviewpoint based similarity measure. It will consider multipleviewpoints while measuring similarity which facilitates highly informative assessment of similarity. We built a prototype application to demonstrate the proof of concept. The empirical results revealed that the measure is effective.
机译:聚类是在实时应用中具有重要效用的数据挖掘技术之一。聚类是一组具有最高相似性的对象。聚类结果可以在包括查询处理在内的许多应用程序中进一步使用。聚类也可以用于文本挖掘。该领域中现有的聚类算法使用单个视点来查找对象之间的相似性。但是,单一观点相似性度量不能对相似性进行高度有益的评估。在本文中,我们提出并实现了一种称为基于多视点的相似性度量的新颖度量。在测量相似性时,它将考虑多个观点,这有助于对相似性进行高度有益的评估。我们构建了一个原型应用程序来演示概念验证。实证结果表明该措施是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号