Incomplete Big Data Distributed Clustering

机译：不完整的大数据分布式聚类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Partially missing or blurring attribute values make data become incomplete during collecting data. Generally we use imputation or discarding method to deal with incomplete data before clustering. In this paper we proposed an a new similarity metrics algorithm based on incomplete information system. First algorithm divided the data set into a complete data set and non complete data set, and then the complete data set was clustered using the affinity propagation clustering algorithm, incomplete data according to the design method of the similarity metric is divided into the corresponding cluster. In order to improve the efficiency of the algorithm, designing the distributed clustering algorithm based on cloud computing technology. Experiment demonstrates the proposed algorithm can cluster the incomplete big data directly and improve the accuracy and effectively.

机译：部分丢失或模糊属性值使数据在收集数据期间变得不完整。一般来说，我们使用归咎或丢弃方法来处理群集之前的不完整数据。本文提出了一种基于不完整信息系统的新的相似度量算法。第一算法将数据集分成完整的数据集和非完整数据集，然后使用关联传播聚类算法群集完整数据集，根据相似度量的设计方法的不完整数据被划分为相应的群集。为了提高算法效率，基于云计算技术设计分布式聚类算法。实验演示了所提出的算法可以直接聚类不完整的大数据并有效地提高准确性。

著录项

来源
《International Conference on Manufacturing Technology and Electronics Applications》|2014年||共4页
会议地点
作者
Yonglin Leng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TH16-53;
关键词
Incomplete big data; AP clustering; Cloud computing;

机译：不完整的大数据;AP聚类;云计算;

相似文献

外文文献
中文文献
专利

1. A Distributed Weighted Possibilistic c-Means Algorithm for Clustering Incomplete Big Sensor Data [J] . QingchenZhang, ZhikuiChen International Journal of Distributed Sensor Networks . 2014,第1期

机译：分布式不完全大传感器数据聚类的加权可能c均值算法
2. Multiple imputation for analysis of incomplete data in distributed health data networks [J] . Changgee Chang, Yi Deng, Xiaoqian Jiang, Nature Communications . 2020,第1期

机译：分布式健康数据网络中的不完整数据分析多重估算
3. Revisiting French tomato data: Cluster analysis with incomplete data [J] . Plaehn Dave Food Quality and Preference . 2019,第期

机译：重新审视法国番茄数据：具有不完整数据的集群分析
4. Incomplete Big Data Distributed Clustering [C] . Yonglin Leng International Conference on Manufacturing Technology and Electronics Applications . 2014

机译：不完整的大数据分布式聚类
5. Distributed Query Processing Over Incomplete, Sampled, and Locality-Aware Data [D] . Sundarmurthy, Bruhathi. 2018

机译：对不完整，采样和位置感知的数据进行分布式查询处理
6. The Optimally Designed Variational Autoencoder Networks for Clustering and Recovery of Incomplete Multimedia Data [O] . Xiulan Yu, Hongyu Li, Zufan Zhang, 2019

机译：针对不完整多媒体数据的聚类和恢复的优化设计的变分自动编码器网络
7. Incomplete Big Data Distributed Clustering [O] . Yonglin Leng 2016

机译：不完整的大数据分布式聚类
8. Distributed Tracking with Consensus on Noisy Time-varying Graphs with Incomplete Data. [R] . Jayaweera, S. K., Ruan, Y., Erwin, R. S. 2010

机译：具有不完全数据的噪声时变图的分布式跟踪共识。

Incomplete Big Data Distributed Clustering

摘要

著录项

相似文献

相关主题

期刊订阅