首页> 外文学位 >On density-based and representative-based spatial clustering algorithms.
【24h】

On density-based and representative-based spatial clustering algorithms.

机译:基于密度和基于代表的空间聚类算法。

获取原文
获取原文并翻译 | 示例

摘要

Finding interesting patterns in spatial data sets is essential for many applications. Spatial data sets have unique characteristics, for instance autocorrelation, the continuous nature of space, complex spatial data types, the importance of maps as summaries, and the necessity to deal with a large number of potential patterns. However, traditional data mining techniques do not take the unique characteristics of spatial data into consideration; consequently, they do not perform well for applications of this kind. In this study, we approach spatial clustering from two different directions. The first part of the dissertation centers on utilizing supervised density estimation techniques for spatial clustering. Two novel density-based clustering algorithms, DCONTOUR and DENTRAC for clustering spatial point and trajectory data, are introduced and analyzed. Moreover, since DCONTOUR uses polygons as models for spatial clusters, polygon-based spatial post-analysis techniques are proposed which characterize and mine spatial clusters. The second part of the research addresses the need to have an efficient clustering algorithm for large spatial data sets. A representative-based spatial clustering algorithm named CLEVER is parallelized in this study. Different parallel computing paradigms including OpenMP and GPU computing (Nvidia CUDA) are investigated for this purpose. Parallel implementations of CLEVER achieved up to 100 time speed up compared to its sequential counterpart for benchmark data sets ranging between 3,000 and 2,000,000 objects.
机译:在许多应用中,在空间数据集中寻找有趣的模式至关重要。空间数据集具有独特的特征,例如自相关,空间的连续性,复杂的空间数据类型,作为摘要的地图的重要性以及处理大量潜在模式的必要性。但是,传统的数据挖掘技术并未考虑空间数据的独特特征;因此,它们在此类应用程序中表现不佳。在这项研究中,我们从两个不同的方向研究空间聚类。论文的第一部分集中于利用监督密度估计技术进行空间聚类。介绍并分析了两种新颖的基于密度的聚类算法DCONTOUR和DENTRAC,用于对空间点和轨迹数据进行聚类。此外,由于DCONTOUR使用多边形作为空间聚类的模型,因此提出了基于多边形的空间后分析技术,该技术可以表征和挖掘空间聚类。研究的第二部分解决了对大型空间数据集具有有效的聚类算法的需求。在这项研究中并行化了一个基于代表的空间聚类算法CLEVER。为此,研究了包括OpenMP和GPU计算(Nvidia CUDA)在内的不同并行计算范例。与3000到2,000,000对象范围内的基准数据集的CLEVER并行实现相比,CLEVER的并行实现实现了多达100倍的速度加速。

著录项

  • 作者

    Chen, Chun-Sheng.;

  • 作者单位

    University of Houston.;

  • 授予单位 University of Houston.;
  • 学科 Computer Science.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 127 p.
  • 总页数 127
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号