首页> 外文会议>Fuzzy logic and applications >What Can Fuzzy Cluster Analysis Contribute to Clustering of High-Dimensional Data?

【24h】

What Can Fuzzy Cluster Analysis Contribute to Clustering of High-Dimensional Data?

机译：模糊聚类分析对高维数据聚类有何贡献？

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Cluster analysis of high-dimensional data has become of special interest in recent years. The term high-dimensional data can refer to a larger number of attributes-20 or more-as they often occur in database tables. But high-dimensional data can also mean that we have to deal with thousands of attributes as in the context of genomics or proteomics data where thousands of genes or proteins are measured and are considered in some analysis tasks as attributes. A main reason, why cluster analysis of high-dimensional data is different from clustering low-dimensional data, is the concentration of norm phenomenon, which states more or less that the relative differences between distances between randomly distributed points tend to be more and more similar in higher dimensions. On the one hand, fuzzy cluster analysis has been shown to be less sensitive to initialisation than, for instance, the classical k-means algorithm. On the other, standard fuzzy clustering is stronger affected by the concentration of norm phenomenon and tends to fail easily in high dimensions. Here we present a review of why fuzzy clustering has special problems with high-dimensional data and how this can be amended by modifying the fuzzifier concept. We also describe a recently introduced approach based on correlation and an attribute selection fuzzy clustering technique that can be applied when clusters can only be found in lower dimensions.

机译：近年来，对高维数据进行聚类分析已引起特别关注。高维数据一词可以指20个或更多的大量属性，因为它们经常出现在数据库表中。但是高维数据也可能意味着我们必须处理成千上万个属性，例如在基因组学或蛋白质组学数据中，要测量成千上万的基因或蛋白质，并在某些分析任务中将其视为属性。高维数据的聚类分析与低维数据的聚类分析不同的主要原因是规范现象的集中，它或多或少地表明随机分布点之间的距离之间的相对差异趋于越来越相似在更高的尺寸。一方面，已证明模糊聚类分析对初始化的敏感性不如例如经典k均值算法。另一方面，标准模糊聚类受规范现象集中度的影响更大，并且在高维方面容易失败。在这里，我们对模糊聚类为何对高维数据存在特殊问题以及如何通过修改模糊器概念进行修正的问题进行了综述。我们还描述了一种基于相关性和属性选择模糊聚类技术的最新介绍的方法，该方法可以在只能在较低维中找到聚类时应用。

著录项

来源
《Fuzzy logic and applications》|2013年|1-14|共14页
会议地点 Genoa(IT)
作者
Frank Klawonn;
展开▼
作者单位

Bioinformatics Statistics Helmholtz-Centre for Infection ResearchInhoffenstr. 7, D-38124 Braunschweig, Germany,Department of Computer Science Ostfalia University of Applied Sciences Salzdahlumer Str. 46/48, D-38302 Wolfenbuettel, Germany;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Cluster Analysis on High-Dimensional Data: A Comparison of Density-based Clustering Algorithms [J] . Aina Musdholifah, Siti Zaiton Mohd Hashim Australian Journal of Basic and Applied Sciences . 2013,第2013期

机译：高维数据的聚类分析：基于密度的聚类算法的比较
2. Clustering High-Dimensional Data Stream: A Survey on Subspace Clustering, Projected Clustering on Bioinformatics Applications (Advanced Science, Engineering and Medicine, Vol. 8(9), pp. 749–757 (2016)) [J] . Baghernia Ali, Pavin Hamid, Mirnabibaboli Miresmail, Advanced Science, Engineering and Medicine . 2017,第7期

机译：聚类高维数据流：生物信息学应用中预计集群的子空间聚类调查（高级科学，工程和医学，Vol.8（9），PP。749-757（2016））
3. ERRATUM: Clustering High-Dimensional Data Stream: A Survey on Subspace Clustering, Projected Clustering on Bioinformatics Applications [J] . Ali Baghernia, Hamid Pavin, Miresmail Mirnabibaboli, Advanced Science, Engineering and Medicine . 2017,第7期

机译：erratum：群集高维数据流：生物信息学应用中的子空间聚类调查，投影群集
4. What Can Fuzzy Cluster Analysis Contribute to Clustering of High-Dimensional Data? [C] . Frank Klawonn International Workshop on Fuzzy Logic and Applications . 2013

机译：模糊集群分析有什么贡献对高维数据的聚类？
5. High-Dimensional Data Clustering and Statistical Analysis of Clustering-based Data Summarization Products. [D] . Zhou, Dunke. 2012

机译：高维数据聚类和基于聚类的数据汇总产品的统计分析。
6. Unsupervised Approach Data Analysis Based on Fuzzy Possibilistic Clustering: Application to Medical Image MRI [O] . Nour-Eddine El Harchaoui, Mounir Ait Kerroum, Ahmed Hammouch, 2013

机译：基于模糊可能性聚类的无监督进场数据分析：在医学图像MRI中的应用
7. Analysis of clinical flow cytometric immunophenotyping data by clustering on statistical manifolds: Treating flow cytometry data as high-dimensional objects How to cite this article: Finn WG, Carter KM, Raich R, Stoolman LM, Hero AO. Analysis of clinical flow cytometric immunophenotyping data by clustering on statistical manifolds: Treating flow cytometry data as high-dimensional objects. Cytometry Part B 2009; 76B: 1–7. [O] . Finn, William G., Carter, Kevin M., Raich, Raviv, 2009

机译：通过聚类统计流形分析临床流式细胞免疫表型数据：将流式细胞术数据作为高维物体处理如何引用本文：Finn WG，Carter Km，Raich R，stoolman Lm，Hero aO。通过聚类在统计流形上分析临床流式细胞免疫表型分析数据：将流式细胞术数据作为高维物体处理。细胞计数B部分2009; 76B：1-7。

What Can Fuzzy Cluster Analysis Contribute to Clustering of High-Dimensional Data?

摘要

著录项

相似文献

相关主题

期刊订阅