An Enhanced Apriori Algorithm Using Hybrid Data Layout Based on Hadoop for Big Data Processing

Yassir ROCHD; Imad HAFIDI

首页> 外文期刊>International journal of computer science and network security >An Enhanced Apriori Algorithm Using Hybrid Data Layout Based on Hadoop for Big Data Processing

【24h】

An Enhanced Apriori Algorithm Using Hybrid Data Layout Based on Hadoop for Big Data Processing

机译：一种基于Hadoop的混合数据布局改进Apriori算法，用于大数据处理

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Frequent itemset mining is one of the data mining methodes implemeted to find frequent patterns, utilized in prediction, association rule mining, classification, etc. Apriori algorithm is an iterative method , that is used to discover frequent itemsets from transactional dataset. It scans entire dataset in every iteration to come up with the large frequent itemsets of various cardinality, which sounds efficient for small data but not useful for big data. To resolve the problem of treatment dataset in every iteration, we present an algorithm called Hybrid Frequent Itemset Mining on Hadoop ( HFIMH ) which uses the vertical layout of dataset to solve the problem of treatment the dataset in every iteration. Vertical dataset conveys information to discover support of every itemsets, and the idea of set intersection is utilized to compute it. We compare the execution of HFIMH with another Hadoop based implementation of Apriori algorithm for different datasets. Experimental results demonstrate that our approach is better.

机译：频繁项集挖掘是用于发现频繁模式的数据挖掘方法之一，用于预测，关联规则挖掘，分类等。Apriori算法是一种迭代方法，用于从事务数据集中发现频繁项集。它在每次迭代中扫描整个数据集，以提供各种基数的大型频繁项集，这对于小数据来说听起来很有效，但对大数据却没有用。为了解决每次迭代中处理数据集的问题，我们提出了一种称为Hadoop的混合频繁项集挖掘（HFIMH）的算法，该算法使用数据集的垂直布局来解决每次迭代中对数据集的处理问题。垂直数据集传达信息以发现每个项目集的支持，并利用集合相交的思想进行计算。我们将HFIMH的执行与针对不同数据集的Apriori算法的另一种基于Hadoop的实现进行了比较。实验结果表明我们的方法更好。

著录项

来源
《International journal of computer science and network security》 |2018年第6期|共7页
作者
Yassir ROCHD; Imad HAFIDI;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Applicability of Apriori Based Association Rules on Medical Data: Identification of Associations on Medical Data/Heart disease Dataset using Apriori Based Algorithm [J] . P. Sambasiva Rao, T. Uma Devi International Journal of Applied Engineering Research . 2017,第20aPta2期

机译：基于APRIORI基于医疗数据的适用性：使用基于APRiori的算法识别医学数据/心脏病数据集的关联
2. Incorporating security and integrity into the miningprocess of hybrid weighted-hashT apriori algorithmusing Hadoop [J] . R. Sumithra, Sujni Paul International journal of data science . 2018,第3期

机译：将安全性和完整性纳入混合加权 - HASHT APRIORI算法Hadoop的挖掘过程中
3. Improved FTWeightedHashT Apriori Algorithm for Big Data using Hadoop-MapReduce Model [J] . Sarem M. Ammar, Fadl M. Ba-Alwi British Journal of Mathematics & Computer Science . 2018,第1期

机译：使用Hadoop-MapReduce模型改进的大数据FTWeightedHashT先验算法
4. Discussion and Improvement of Apriori Algorithm of Data Mining Based on Hadoop Platform [C] . Mengyang Zhao, Bo Tang, Le Yang International Conference on Frontiers of Manufacturing Science and Measuring Technology . 2017

机译：基于Hadoop平台的数据挖掘APRIORI算法的探讨与改进
5. Algorithmic and software system support to accelerate data processing in CPU-GPU hybrid computing environments. [D] . Wang, Kaibo. 2015

机译：算法和软件系统支持可加速CPU-GPU混合计算环境中的数据处理。
6. Apriori Algorithm for the Data Mining of Global Cyberspace Security Issues for Human Participatory Based on Association Rules [O] . Zhi Li, Xuyu Li, Runhua Tang, 2020

机译：基于关联规则的人类参与性全球网络空间安全问题数据挖掘的APRiori算法
7. Discussion and Improvement of Apriori Algorithm of Data Mining Based on Hadoop Platform [O] . Mengyang Zhao, Bo Tang, Le Yang 2017

机译：基于Hadoop平台的数据挖掘APRIORI算法的探讨与改进
8. Data Processing Algorithms for Inferring Stratospheric Gas Concentrations from Balloon-Based Solar Occultation Data [R] . Chang, I. L. 1987

机译：基于气球的太阳掩星数据推断平流层气体浓度的数据处理算法

An Enhanced Apriori Algorithm Using Hybrid Data Layout Based on Hadoop for Big Data Processing

摘要

著录项

相似文献

相关主题

期刊订阅