Determination of the appropriate parameters for K-means clustering using selection of region clusters based on density DBSCAN (SRCD-DBSCAN)

Limwattanapibool Onapa; Arch-int Somjit

首页> 外文期刊>Expert Systems >Determination of the appropriate parameters for K-means clustering using selection of region clusters based on density DBSCAN (SRCD-DBSCAN)

【24h】

Determination of the appropriate parameters for K-means clustering using selection of region clusters based on density DBSCAN (SRCD-DBSCAN)

机译：使用基于密度DBSCAN（SRCD-DBSCAN）的区域聚类选择，确定适合K均值聚类的参数

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

K-means clustering can be highly accurate when the number of clusters and the initial cluster centre are appropriate. An inappropriate determination of the number of clusters or the initial cluster centre decreases the accuracy of K-means clustering. However, determining these values is problematic. To solve these problems, we used density-based spatial clustering of application with noise (DBSCAN) because it does not require a predetermined number of clusters; however, it has some significant drawbacks. Using DBSCAN with high-dimensional data and data with potentially different densities decreases the accuracy to some degree. Therefore, the objective of this research is to improve the efficiency of DBSCAN through a selection of region clusters based on density DBSCAN to automatically find the appropriate number of clusters and initial cluster centres for K-means clustering. In the proposed method, DBSCAN is used to perform clustering and to select the appropriate clusters by considering the density of each cluster. Subsequently, the appropriate region data are chosen from the selected clusters. The experimental results yield the appropriate number of clusters and the appropriate initial cluster centres for K-means clustering. In addition, the results of the selection of region clusters based on density DBSCAN method are more accurate than those obtained by traditional methods, including DBSCAN and K-means and related methods such as Partitioning-based DBSCAN (PDBSCAN) and PDBSCAN by applying the Ant Clustering Algorithm DBSCAN (PACA-DBSCAN).

机译：当聚类的数量和初始聚类中心合适时，K均值聚类可以非常准确。聚类数量或初始聚类中心的不适当确定会降低K均值聚类的准确性。但是，确定这些值是有问题的。为了解决这些问题，我们使用了基于密度的带噪声的应用程序空间聚类（DBSCAN），因为它不需要预定数量的聚类。但是，它有一些明显的缺点。对高维数据和密度可能不同的数据使用DBSCAN会在一定程度上降低准确性。因此，本研究的目的是通过基于密度DBSCAN的区域聚类选择以自动找到合适数量的聚类和初始聚类中心来进行K均值聚类，从而提高DBSCAN的效率。在提出的方法中，DBSCAN用于执行聚类并通过考虑每个聚类的密度来选择适当的聚类。随后，从选定的群集中选择适当的区域数据。实验结果为K均值聚类产生了适当数量的聚类和适当的初始聚类中心。此外，基于密度DBSCAN方法的区域聚类选择结果比传统方法（包括DBSCAN和K-means以及相关方法，例如通过应用Ant的基于分区的DBSCAN（PDBSCAN）和PDBSCAN）获得的结果更准确。聚类算法DBSCAN（PACA-DBSCAN）。

著录项

来源
《Expert Systems》 |2017年第3期|e12204.1-e12204.11|共11页
作者
Limwattanapibool Onapa; Arch-int Somjit;
展开▼
作者单位

Khon Kaen Univ, Fac Sci, Dept Comp Sci, Khon Kaen 40002, Thailand;

Khon Kaen Univ, Fac Sci, Dept Comp Sci, Khon Kaen 40002, Thailand;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
clustering; DBSCAN; density-based clustering; initial cluster centres; K-means; number of clusters;

机译：聚类;DBSCAN;基于密度的聚类;初始聚类中心;K均值;聚类数;

相似文献

外文文献
中文文献
专利

1. Parameter selection algorithm of DBSCAN based on K-means two classification algorithm [J] . Chen Shouhong, Liu Xinyu, Ma Jun, . 2019,第23期

机译：基于k均值的DBSCAN参数选择算法两种分类算法
2. DIC-DOC-K-means: Dissimilarity-based Initial Centroid selection for DOCument clustering using K-means for improving the effectiveness of text document clustering [J] . Lakshmi R., Baskar S. Journal of Information Science . 2019,第6期

机译：DIC-DOC-K-means：使用K-means的DOCument聚类基于不相似性的初始质心选择，以提高文本文档聚类的效率
3. Multiple Parameter Based Clustering (MPC): Prospective Analysis for Effective Clustering in Wireless Sensor Network (WSN) Using K-Means Algorithm [J] . Md. Asif Khan, Israfil Tamim, Emdad Ahmed, Wireless Sensor Network . 2012,第1期

机译：基于多参数的聚类（MPC）：使用K均值算法的无线传感器网络（WSN）中有效聚类的前瞻性分析
4. An improved DBSCAN, a density based clustering algorithm with parameter selection for high dimensional data sets [C] . Shah Glory H. Nirma University International Conference on Engineering . 2012

机译：改进的DBSCAN，一种基于密度的聚类算法，具有针对高维数据集的参数选择
5. K-means clustering with automatic determination of K using a Multiobjective Genetic Algorithm with applications to microarray gene expression data. [D] . Shaw, Matthew Karl Ellis. 2015

机译：使用多目标遗传算法自动确定K值的K均值聚类，并应用于微阵列基因表达数据。
6. Community Detection Method Based on Node Density Degree Centrality and K-Means Clustering in Complex Network [O] . Biao Cai, Lina Zeng, Yanpeng Wang, 2019

机译：基于节点密度程度中心的社区检测方法以及复杂网络中的k均值聚类
7. ADBSCAN: Adaptive Density-Based Spatial Clustering of Applications with Noise for Identifying Clusters with Varying Densities [O] . Mohammad Mahmudur Rahman Khan, Md. Abu Bakr Siddique, Rezoana Bente Arif, 2018

机译：ADBSCAN：基于自适应的基于密度的空间聚类，用于识别具有不同密度的簇的噪声

Determination of the appropriate parameters for K-means clustering using selection of region clusters based on density DBSCAN (SRCD-DBSCAN)

摘要

著录项

相似文献

相关主题

期刊订阅