Statistical Analysis of Clustering Performances of NMF, Spectral Clustering, and K-means

机译：NMF，谱聚类和K均值聚类性能的统计分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nonnegative matrix factorization (NMF), spectral clustering, and k-means are the most used clustering methods in machine learning research. They have been used in many domains including text, image, and cancer clustering. However, there is still a limited number of works that discuss statistical significance of performance differences between these methods. This issue is epecially important in NMF as this method is still very actively researched with a sheer number of new algorithms are published every year, and being able to demonstrate newly proposed algorithms statistically outperform previous ones is certainly desired. In this paper, we present statistical analysis of clustering performance differences between NMF, spectral clustering, and k-means. We use ten NMF algorithms, six spectral clustering algorithms, and one standard k-means algorithm for benchmark. For data, eleven publicly available microarray gene expression datasets with numbers of classes range from two to ten are used. The experimental results show that statistically performance differences between NMF algorithms and the standard k-means algorithm are not significant, and spectral methods surprisingly perform less well than NMF and k-means.

机译：非负矩阵分解（NMF），频谱聚类和k均值是机器学习研究中最常用的聚类方法。它们已用于许多领域，包括文本，图像和癌症聚类。但是，仍然有数量有限的工作讨论这些方法之间的性能差异的统计意义。这个问题在NMF中尤为重要，因为该方法仍处于非常积极的研究之中，每年都会发布大量新算法，并且肯定希望能够以统计学的方式证明新提出的算法优于以前的算法。在本文中，我们对NMF，频谱聚类和k均值之间的聚类性能差异进行了统计分析。我们使用十种NMF算法，六种频谱聚类算法和一种标准的k均值算法进行基准测试。对于数据，使用了11种可公开获得的微阵列基因表达数据集，其类别数范围为2到10。实验结果表明，NMF算法和标准k均值算法之间的统计性能差异不明显，并且光谱方法出人意料地不如NMF和k均值。

著录项

来源
《International Conference on Computer and Information Sciences》|2020年|1-4|共4页
会议地点
作者
Andri Mirzal;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Clustering algorithms; Kernel; Statistical analysis; Single photon emission computed tomography; Approximation algorithms; Gene expression; Extraterrestrial measurements;

机译：聚类算法;内核;统计分析;单光子发射计算机断层扫描;近似算法;基因表达;外星测量;

相似文献

外文文献
中文文献
专利

1. Performance analysis of optimal cluster selection and intrusion detection by hierarchical K-means clustering with hybrid ABC-DT [J] . Josemila Baby Jesuretnam, Jeba James Rose International journal of pervasive computing and communications . 2021,第1期

机译：用Hybrid ABC-DT进行分层K-MERIAL对最佳聚类选择和入侵检测的性能分析
2. Document Summarization Using NMF and Pseudo Relevance Feedback Based on K-Means Clustering [J] . Park Sun, Cha ByungRae, Kim JongWon Computing and informatics . 2016,第3期

机译：基于K均值聚类的NMF和伪相关反馈的文档汇总
3. DOCUMENT SUMMARIZATION USING NMF AND PSEUDO RELEVANCE FEEDBACK BASED ON K-MEANS CLUSTERING [J] . Park Sun, Cha ByungRae, Kim JongWon Computing and informatics . 2016,第3期

机译：基于K均值聚类的NMF和伪相关反馈的文档摘要
4. Statistical Analysis of Clustering Performances of NMF, Spectral Clustering, and K-means: With Gene Selection [C] . Andri Mirzal International Conference on Computer and Information Sciences . 2020

机译：NMF，谱聚类和K-均值聚类性能的统计分析：带有基因选择
5. Performance analysis of EM-MPM and K-means clustering in 3D ultrasound breast image segmentation [D] . Yang, Huanyi 2013

机译：EM-MPM和K-means聚类在3D超声乳腺图像分割中的性能分析
6. Does Determination of Initial Cluster Centroids Improve the Performance of K-Means Clustering Algorithm? Comparison of Three Hybrid Methods by Genetic Algorithm Minimum Spanning Tree and Hierarchical Clustering in an Applied Study [O] . Saeedeh Pourahmad, Atefeh Basirat, Amir Rahimi, 2020

机译：初始簇质心的确定是否提高了K-Means聚类算法的性能？应用研究中遗传算法最小生成树和分层聚类的三种混合方法的比较
7. Students' Academic Performance Analysis by K-Means Clustering for Investigating Students' Health Conditions within Clusters [O] . Yann Ling Goh, Yeh Huann Goh, Chun-Chieh Yip, 2020

机译：学生的学生学习绩效分析通过K-Means聚类来调查集群中的学生健康状况

Statistical Analysis of Clustering Performances of NMF, Spectral Clustering, and K-means

摘要

著录项

相似文献

相关主题

期刊订阅