首页> 外文会议>Knowledge Acquisition : Approaches, algorithms and applications >Clustering over Evolving Data Streams Basedon Online Recent-Biased Approximation
【24h】

Clustering over Evolving Data Streams Basedon Online Recent-Biased Approximation

机译:基于在线最近偏差近似的演化数据流聚类

获取原文
获取原文并翻译 | 示例

摘要

A growing number of real world applications deal with multiple evolving data streams. In this paper, a framework for clustering over evolving data streams is proposed taking advantage of recent-biased approximation. In recent-biased approximation, more details are preserved for recent data and fewer coefficients are kept for the whole data stream, which improves the efficiency of clustering and space usability greatly. Our framework consists of two phases. One is an online phase which approximates data streams and maintains the summary statistics incrementally. The other is an offline clustering phase which is able to perform dynamic clustering over data streams on all possible time horizons. As shown in complexity analyses and also validated by our empirical studies, our framework performed efficiently in the data stream environment while producing clustering results of very high quality.
机译:越来越多的现实应用程序处理多个不断发展的数据流。在本文中,提出了利用最近偏逼近法对不断发展的数据流进行聚类的框架。在最近偏近似中,为最新数据保留了更多细节,为整个数据流保留了较少的系数,这极大地提高了聚类效率和空间可用性。我们的框架包括两个阶段。一个是在线阶段,它近似数据流并逐步维护摘要统计信息。另一个是脱机集群阶段,它能够在所有可能的时间范围内对数据流执行动态集群。如复杂性分析所示,并通过我们的经验研究验证,我们的框架在数据流环境中有效执行,同时产生非常高质量的聚类结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号