基于相关分析的多数据流聚类

屠莉; 陈崚; 邹凌君

首页> 中文期刊> 《软件学报》 >基于相关分析的多数据流聚类

基于相关分析的多数据流聚类

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

提出基于相关分析的多数据流聚类算法.该算法将多数据流的原始数据快速压缩成一个统计概要.根据这些统计概要,可以增量式地计算相关系数来衡量数据间的相似度.提出了一种改进的k-平均算法来生成聚类结果.改进的k-平均算法可以动态、实时地调整聚类数目,并及时检测数据流的发展变化.还将算法应用到按照用户要求的聚类问题(COD),使得用户可以在任意的时间区间上查询聚类结果.提出了一种合理的时间片断划分机制,使得用户指定的任意时间区间都可以由这些时间片断组合而成.在模拟和真实数据上的实验结果都表明,该算法比其他方法具有更好的聚类质量、速度和稳定性,能够实时地反映数据流的变化.%This paper proposes a compression scheme which quickly compresses the raw data from multiple streams into a compressed synopsis. The synopsis allows to incrementally reconstruct the correlation coefficients without accessing the raw data. A modified k-means algorithm is developed to generate clustering results and dynamically adjust the number of clusters in real time so as to detect the evolving changes in the data streams. Finally, the framework is extended to support clustering on demand (COD), where a user can query for clustering results over an arbitrary time horizon. A theoretically sound time-segment partitioning scheme is developed so that any demand time horizon can be fulfilled by a combination of those time-segments. Experimental results on synthetic and real data sets show that the algorithm has higher clustering quality, speed and stability than other methods and can detect the evolving changes of the data streams in real time.

著录项

来源
《软件学报》 |2009年第7期|1756-1767|共12页
作者
屠莉; 陈崚; 邹凌君;
展开▼
作者单位

南京航空航天大学;

信息科学与技术学院;

江苏;

南京;

210093;

扬州大学;

计算机科学与工程系;

江苏;

扬州;

225009;

南京大学;

计算机软件新技术国家重点实验室;

江苏;

南京;

210093;

扬州大学;

计算机科学与工程系;

江苏;

扬州;

225009;

展开▼
原文格式 PDF
正文语种 chi
中图分类人工智能理论;
关键词
聚类; 数据流; 相关分析;

相似文献

中文文献
外文文献
专利

1. 基于同步相关性的多数据流聚类在空气质量评价中的应用 [J] . 李飒 ,李艳杰 . 辽宁石油化工大学学报 . 2016,第002期
2. 基于WebService的多数据流聚类研究 [J] . 赵文彬 . 计算机光盘软件与应用 . 2011,第024期
3. 基于谱聚类的多数据流演化事件挖掘 [J] . 杨宁 ,唐常杰 ,王悦 . 软件学报 . 2010,第010期
4. 基于Web Service的多数据流聚类研究 [J] . 邹凌君 ,高开周 . 广西轻工业 . 2009,第011期
5. 基于相关度的多数据流动态聚类算法 [C] . 金燕 ,刘青宝 ,侯东风 . 中国计算机用户协会网络应用分会2008年网络新技术与应用研讨会 . 2008
6. 基于核密度估计理论的多数据流聚类研究 [A] . 谢益煌 . 2006

基于相关分析的多数据流聚类

摘要

著录项

相似文献

相关主题

期刊订阅