首页> 外文期刊>Statistics and computing >Divisive clustering of high dimensional data streams
【24h】

Divisive clustering of high dimensional data streams

机译:高维数据流的分裂聚类

获取原文
获取原文并翻译 | 示例
       

摘要

Clustering streaming data is gaining importance as automatic data acquisition technologies are deployed in diverse applications. We propose a fully incremental projected divisive clustering method for high-dimensional data streams that is motivated by high density clustering. The method is capable of identifying clusters in arbitrary subspaces, estimating the number of clusters, and detecting changes in the data distribution which necessitate a revision of the model. The empirical evaluation of the proposed method on numerous real and simulated datasets shows that it is scalable in dimension and number of clusters, is robust to noisy and irrelevant features, and is capable of handling a variety of types of non-stationarity.
机译:随着在各种应用程序中部署自动数据采集技术,对流数据进行群集变得越来越重要。我们提出了一种以高密度聚类为动力的高维数据流的完全增量投影除法聚类方法。该方法能够识别任意子空间中的聚类,估计聚类的数量,并检测需要修改模型的数据分布中的变化。对大量真实和模拟数据集所提出的方法进行的经验评估表明,该方法在聚类的维度和数量上可扩展,对嘈杂和无关的特征具有鲁棒性,并且能够处理多种类型的非平稳性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号