基于改进FP-tree的最大频繁项目集挖掘算法

马丽生; 姚光顺; 杨传健

首页> 中文期刊> 《计算机应用》 >基于改进FP-tree的最大频繁项目集挖掘算法

基于改进FP-tree的最大频繁项目集挖掘算法

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

针对已有算法为了减少PF-tree中路径被重复遍历的次数,需要保存FP-tree中所有频繁1-项集的条件模式基的问题,对FP-tree的数据结构进行修改,使得只需要保存FP-tree中每个叶子节点的父节点到根节点路径上项目组成的条件模式基,降低了保存条件模式基的存储空间开销.在分析最大频繁项目集挖掘算法中搜索空间以及数据表示方法的基础上,通过理论分析和证明,设计了剪枝策略和压缩策略,缩小了算法搜索空间,压缩了FP-tree的规模,提高了算法的执行效率.最后将新算法分别与NHTFPG算法、FpMAX算法进行对比,验证算法的正确性和有效性.实验结果表明,新算法保存FP-tree条件模式基所需要的存储空间不到NHTFPG算法的50％,执行效率比FpMAX算法提高了2～3倍.%In order to reduce the repeated traversal times of path in the FP-tree, the conditional pattern bases of all frequent 1 -itemsets in the FP-tree need to be saved in the existing algorithms. Concerning this problem, in the new algorithm, the data structure of FP-tree was improved that only the conditional pattern bases were saved which were constituted by the items in the path from every leaf node' parents to the root in the FP-tree, and the storage space of the conditional pattern bases was reduced. After studying search space and the method of data representation in the algorithm for mining maximal frequent itemsets, the pruning and compression strategies were developed through theoretical analysis and verification, which could decrease the search space and the scale of FP-tree. Finally, the new algorithm was compared with NHTFPG algorithm and FpMAX algorithm respectively in terms of accuracy and efficiency. The experimental results show that the new FP-tree algorithm saves the required conditions for model-based storage space more than 50% than NHTFPG algorithm, and the efficiency ratio improves by 2 to 3 times than FpMAX algorithm.

著录项

来源
《计算机应用》 |2012年第2期|326-329|共4页
作者
马丽生; 姚光顺; 杨传健;
展开▼
作者单位

滁州学院计算机与信息工程学院;

安徽滁州239000;

滁州学院计算机与信息工程学院;

安徽滁州239000;

滁州学院计算机与信息工程学院;

安徽滁州239000;

展开▼
原文格式 PDF
正文语种 chi
中图分类 TP311.13;
关键词
频繁项目集; 最大频繁项目集; 条件模式基; 项头表; 剪枝策略; 压缩策略;

相似文献

中文文献
外文文献
专利

1. 基于FP-Tree的最大频繁项目集挖掘算法 [J] . 陈向华 ,刘可昂 . 软件 . 2015,第012期
2. 一种基于FP-tree的最大频繁项目集挖掘算法 [J] . 梅俊 ,郑刚 . 现代计算机（专业版） . 2009,第009期
3. 基于FP-tree的最大频繁项目集挖掘算法 [J] . 马丽生 ,邓辉文 ,齐逸 . 计算机工程与设计 . 2008,第002期
4. 一种基于FP-tree的最大频繁项目集挖掘算法 [J] . 刘乃丽 ,李玉忱 ,马磊 . 计算机应用 . 2005,第005期
5. 基于FP-Tree的最大频繁项目集更新挖掘算法 [J] . 杨君锐 ,赵群礼 . 华中科技大学学报：自然科学版 . 2004,第11期
6. 一种基于FP-tree的最大频繁项目集挖掘算法 [C] . 朱玉全 ,孙志挥 ,宋余庆 . 第十九届全国数据库学术会议 . 2002
7. 基于Apriori和FP-TREE的频繁项目集挖掘算法 [A] . 黄黎明 . 2018

基于改进FP-tree的最大频繁项目集挖掘算法

摘要

著录项

相似文献

相关主题

期刊订阅