Quasi-cluster centers clustering algorithm based on potential entropy and t-distributed stochastic neighbor embedding

Fang Xian; Tie Zhixin; Guan Yinan; Rao Shanshan

首页> 外文期刊>Soft computing: A fusion of foundations, methodologies and applications >Quasi-cluster centers clustering algorithm based on potential entropy and t-distributed stochastic neighbor embedding

【24h】

Quasi-cluster centers clustering algorithm based on potential entropy and t-distributed stochastic neighbor embedding

机译：基于电位熵和T分布式随机邻居嵌入的准簇中心聚类算法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A novel density-based clustering algorithm named QCC is presented recently. Although the algorithm has proved its strong robustness, it is still necessary to manually determine the two input parameters, including the number of neighbors (k) and the similarity threshold value (), which severely limits the promotion of the algorithm. In addition, the QCC does not perform excellently when confronting the datasets with relatively high dimensions. To overcome these defects, firstly, we define a new method for computing local density and introduce the strategy of potential entropy into the original algorithm. Based on this idea, we propose a new QCC clustering algorithm (QCC-PE). QCC-PE can automatically extract optimal value of the parameter k by optimizing potential entropy of data field. By this means, the optimized parameter can be calculated from the datasets objectively rather than the empirical estimation accumulated from a large number of experiments. Then, t-distributed stochastic neighbor embedding (tSNE) is applied to the model of QCC-PE and further brings forward a method based on tSNE (QCC-PE-tSNE), which preprocesses high-dimensional datasets by dimensionality reduction technique. We compare the performance of the proposed algorithms with QCC, DBSCAN, and DP in the synthetic datasets, Olivetti Face Database, and real-world datasets respectively. Experimental results show that our algorithms are feasible and effective and can often outperform the comparisons.

机译：最近介绍了名为QCC的新型基于密度的聚类算法。尽管算法证明了其强大的稳健性，但仍然需要手动确定两个输入参数，包括邻居（k）的数量和相似度阈值（），其严重限制算法促销。此外，在面对具有相对高维度的数据集时，QCC不会出色。为了克服这些缺陷，首先，我们定义了一种用于计算本地密度的新方法，并将潜在熵的策略引入原始算法。基于这个想法，我们提出了一种新的QCC聚类算法（QCC-PE）。通过优化数据字段的潜在熵，QCC-PE可以自动提取参数k的最佳值。通过这种方式，可以客观地从数据集计算优化参数而不是从大量实验中累积的经验估计来计算。然后，将T分布式随机邻居嵌入（TSNE）应用于QCC-PE的模型，并进一步推动基于TSNE（QCC-PE-TSNE）的方法，其通过维度减少技术预处理高维数据集。我们将建议的算法与QCC，DBSCAN和DP中提出的算法分别进行比较分别。实验结果表明，我们的算法是可行且有效的，并且通常可以优于比较。

著录项

来源
《Soft computing: A fusion of foundations, methodologies and applications》 |2019年第14期|共13页
作者
Fang Xian; Tie Zhixin; Guan Yinan; Rao Shanshan;
展开▼
作者单位

Zhejiang Sci Tech Univ Sch Informat Sci &

Technol Hangzhou Zhejiang Peoples R China;

Zhejiang Sci Tech Univ Sch Informat Sci &

Technol Hangzhou Zhejiang Peoples R China;

Zhejiang Sci Tech Univ Sch Informat Sci &

Technol Hangzhou Zhejiang Peoples R China;

Zhejiang Sci Tech Univ Sch Informat Sci &

Technol Hangzhou Zhejiang Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算机软件;
关键词
Data clustering; Quasi-cluster centers clustering; Potential entropy; Optimal parameter; t-distributed stochastic neighbor embedding;

机译：数据聚类;准群集中心聚类;潜在的熵;最佳参数;T分布式随机邻居嵌入;

相似文献

外文文献
中文文献
专利

1. Quasi-cluster centers clustering algorithm based on potential entropy and t-distributed stochastic neighbor embedding [J] . Fang Xian, Tie Zhixin, Guan Yinan, Soft computing: A fusion of foundations, methodologies and applications . 2019,第14期

机译：基于电位熵和T分布式随机邻居嵌入的准簇中心聚类算法
2. Using t-distributed Stochastic Neighbor Embedding (t-SNE) for cluster analysis and spatial zone delineation of groundwater geochemistry data [J] . Liu Honghua, Yang Jing, Ye Ming, Journal of Hydrology . 2021,第1期

机译：采用T分布式随机邻居嵌入（T-SNE）进行集群分析和地下水地球化学数据的空间区域描绘
3. Chemometric Classification of Crude Oils in Complex Petroleum Systems Using t-Distributed Stochastic Neighbor Embedding Machine Learning Algorithm [J] . Tao Keyu, Cao Jian, Wang Yuce, Energy & fuels . 2020,第5期

机译：用T分布随机邻嵌入机学习算法复杂石油系统中原油的化学计量分类
4. t-Distributed stochastic neighbor embedding spectral clustering [C] . Nicoleta Rogovschi, Jun Kitazono, Nistor Grozavu, International Joint Conference on Neural Networks . 2017

机译：t分布随机邻居嵌入频谱聚类
5. K-centers dynamic clustering algorithms and applications. [D] . Xie, Qing Yan. 2013

机译：K中心动态聚类算法和应用程序。
6. Multiscale Distribution Entropy and t-Distributed Stochastic Neighbor Embedding-Based Fault Diagnosis of Rolling Bearings [O] . Deyu Tu, Jinde Zheng, Zhanwei Jiang, 2018

机译：多尺度分布熵和T分布式随机邻居滚动轴承的故障诊断
7. A fully automated spike sorting algorithm using t-distributed neighbor embedding and density based clustering [O] . Mohammad Hossein Nadian, Saeed Karimimehr, Jafar Doostmohammadi, 2018

机译：一种使用T分布式邻居嵌入和基于密度的聚类的全自动尖峰分类算法

Quasi-cluster centers clustering algorithm based on potential entropy and t-distributed stochastic neighbor embedding

摘要

著录项

相似文献

相关主题

期刊订阅