Experimental Comparisons of Clustering Approaches for Data Representation

Anand Sanjay Kumar; Kumar Suresh

首页> 外文期刊>ACM computing surveys >Experimental Comparisons of Clustering Approaches for Data Representation

【24h】

Experimental Comparisons of Clustering Approaches for Data Representation

机译：Experimental Comparisons of Clustering Approaches for Data Representation

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Clustering approaches are extensively used by many areas such as IR, Data Integration, Document Classification, Web Mining, Query Processing, and many other domains and disciplines. Nowadays, much literature describes clustering algorithms on multivariate data sets. However, there is limited literature that presented them with exhaustive and extensive theoretical analysis as well as experimental comparisons. This experimental survey paper deals with the basic principle, and techniques used, including important characteristics, application areas, run-time performance, internal, external, and stability validity of cluster quality, etc., on five different data sets of eleven clustering algorithms. This paper analyses how these algorithms behave with five different multivariate data sets in data representation. To answer this question, we compared the efficiency of eleven clustering approaches on five different data sets using three validity metrics-internal, external, and stability and found the optimal score to know the feasible solution of each algorithm. In addition, we have also included four popular and modern clustering algorithms with only their theoretical discussion. Our experimental results for only traditional clustering algorithms showed that different algorithms performed different behavior on different data sets in terms of running time (speed), accuracy and, the size of data set. This study emphasized the need for more adaptive algorithms and a deliberate balance between the running time and accuracy with their theoretical as well as implementation aspects.

著录项

来源
《ACM computing surveys》 |2023年第3期|45.1-45.33|共33页
作者
Anand Sanjay Kumar; Kumar Suresh;
展开▼
作者单位

NSUT;

GGSIP Univ;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种英语
中图分类
关键词
Clustering approach; internal validation; external validation; stability validation; optimal score;

Experimental Comparisons of Clustering Approaches for Data Representation

摘要

著录项

相关主题

期刊订阅