d-FuzzStream: A Dispersion-Based Fuzzy Data Stream Clustering

机译：d-FuzzStream：基于分散的模糊数据流聚类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Fuzzy clustering algorithms have recently been investigated as appropriate techniques to extract knowledge from Data Streams due to their unsupervised nature and flexibility to deal with changes in the distribution of data. While most fuzzy clustering algorithms for Data Streams are based on chunks, the FuzzStream algorithm, proposed before by the authors of this paper, pioneered a fuzzy extension of a different approach known as the Online-Offline Framework (OOF). The extended framework, named Fuzzy Online-Offline Framework (FOOF), includes two steps known as fuzzy abstraction and fuzzy clustering. The fuzzy abstraction step continuously summarizes data in a set of cluster features called Fuzzy Micro Cluster (FMiC). Then, these FMiCs are later clustered in the fuzzy clustering step to generate the data partition. Although FuzzStream has shown to be more robust than other OOF-based algorithms, the fuzzy abstraction process in the algorithm overly reduces the data summarization, almost producing one FMiC for each example, also suffering from high overlapping FMiCs. Furthermore, the algorithm has a long processing time due to its need to calculate membership matrices for every example. In this paper we propose the d-FuzzStream algorithm, an adaptation of FuzzStream using the concepts of fuzzy dispersion and fuzzy similarity in order to improve the data summarization while minimizing the complexity of the algorithm. Experiments showed that the proposed algorithm generates FMiCs with higher representativeness and lower execution time than its original version, still producing similar clustering results.

机译：由于模糊聚类算法的不受监督的性质和处理数据分布变化的灵活性，最近已经研究了模糊聚类算法作为从数据流中提取知识的适当技术。尽管大多数用于数据流的模糊聚类算法都是基于块的，但本文作者之前提出的FuzzStream算法却开创了另一种方法的模糊扩展，称为在线-离线框架（OOF）。扩展框架名为模糊在线-离线框架（FOOF），包括两个步骤，称为模糊抽象和模糊聚类。模糊抽象步骤连续汇总称为模糊微簇（FMiC）的一组簇特征中的数据。然后，这些FMiC稍后在模糊聚类步骤中聚类以生成数据分区。尽管FuzzStream已显示出比其他基于OOF的算法更强大，但该算法中的模糊抽象过程过度减少了数据汇总，每个示例几乎产生一个FMiC，同时还存在高度重叠的FMiC。此外，由于该算法需要计算每个示例的隶属度矩阵，因此具有较长的处理时间。在本文中，我们提出了d-FuzzStream算法，它是使用模糊分散和模糊相似性的概念对FuzzStream进行的改编，目的是在最小化算法复杂度的同时改善数据汇总。实验表明，与原始算法相比，该算法生成的FMiC具有更高的代表性和更低的执行时间，仍能产生相似的聚类结果。

著录项

来源
《IEEE International Conference on Fuzzy Systems》|2018年|1-8|共8页
会议地点
作者
Leonardo Schick; Priscilla de Abreu Lopes; Heloisa de Arruda Camargo;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Clustering algorithms; Prototypes; Dispersion; Partitioning algorithms; Complexity theory; Merging; Microwave integrated circuits;

机译：聚类算法;原型;色散;分区算法;复杂性理论;合并;微波集成电路;

相似文献

外文文献
中文文献
专利

1. Clustering right-skewed data stream via Birnbaum-Saunders mixture models: A flexible approach based on fuzzy clustering algorithm [J] . Hashemi Farzane, Naderi Mehrdad, Mashinchi Mashallah Applied Soft Computing . 2019,第期

机译：通过Birnbaum-Saunders混合模型聚类右偏斜数据流：一种基于模糊聚类算法的灵活方法
2. Online one pass clustering of data streams based on growing neural gas and fuzzy inference systems [J] . Mahmoudabadi Ali, Kuchaki Rafsanjani Marjan, Javidi Mohammad Masoud Expert Systems . 2021,第7期

机译：基于生长神经气体和模糊推理系统的数据流的在线一个传递聚类
3. Fuzzy Clustering-Based Adaptive Regression for Drifting Data Streams [J] . Song Yiliao, Lu Jie, Lu Haiyan, IEEE Transactions on Fuzzy Systems . 2020,第3期

机译：基于模糊的基于聚类的漂移数据流的自适应回归
4. d-FuzzStream: A Dispersion-Based Fuzzy Data Stream Clustering [C] . Leonardo Schick, Priscilla de Abreu Lopes, Heloisa de Arruda Camargo IEEE International Conference on Fuzzy Systems . 2018

机译：d-fuzzstream：基于分散的模糊数据流群集
5. Stream-Dashboard: A big data stream clustering framework with applications to social media streams. [D] . Hawwash, Basheer. 2013

机译：Stream-Dashboard：一个大数据流集群框架，其应用程序适用于社交媒体流。
6. Incremental Interval Type-2 Fuzzy Clustering of Data Streams using Single Pass Method [O] . Sana Qaiyum, Izzatdin Aziz, Mohd Hilmi Hasan, 2020

机译：使用单遍方法的数据流增量间隔2型模糊聚类
7. An Ensemble of Adaptive Neuro-Fuzzy Kohonen Networks for Online Data Stream Fuzzy Clustering [O] . Hu, Zhengbing, Bodyanskiy, Yevgeniy V., Tyshchenko, Oleksii K., 2016

机译：用于在线数据的自适应神经模糊Kohonen网络集流模糊聚类
8. Fuzzy Clustering and Superclustering Scheme for Extracting Structure from Data [R] . Smith, J. F. 1996

机译：基于数据提取结构的模糊聚类与超集群方案

d-FuzzStream: A Dispersion-Based Fuzzy Data Stream Clustering

摘要

著录项

相似文献

相关主题

期刊订阅