面向数据流的一个高效用项集挖掘算法

慕欢欢; 柴玉梅; 王黎明

首页> 中文期刊> 《计算机应用与软件》 >面向数据流的一个高效用项集挖掘算法

面向数据流的一个高效用项集挖掘算法

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years,to carry out high utility itemset mining in data stream has become an important research topic.Existing algorithms produce a large number of candidate itemsets in mining process and this masks it difficult for the users to screen out useful messages among huge sets of candidate patterns.In light of this situation,we present an algorithm for mining high utility itemsets over data stream,namely HUIDE (high utility itemsets over data stream).First,the algorithm proposes an effective measure of utility metrics by comprehensively considering the information characteristics of data;Then,it describes the distribution of data more accurately using a time-based sliding window and constructs a tree structure,called HUI-tree (high utility itemsets tree).Finally,it traverses the constructed tree structure HUI-tree and mines high utility itemsets.Experimental results in artificial and real data stream show that this algorithm reduces the generation of candidate sets and the consumption of time and space by procuring mining results with scanning database only once.This algorithm can effectively mine high utility itemsets over data stream.%近年来，在数据流中进行高效用项集挖掘成为一个重要的研究课题。已存在的算法在挖掘过程中产生大量的候选项集，使用户很难从大量候选模式中筛选出有用的信息。针对这种情况，提出一个数据流高效用项集挖掘算法HUIDE（High-Utility Item-sets Over Data Streams）。算法首先综合考虑数据的信息特征，提出一种有效的效用度量方法。然后采用基于时间的滑动窗口技术更加准确地描述数据分布，构建一种树结构HUI-tree（High Utility Itemsets tree）。最后遍历构建的树结构HUI-tree挖掘高效用项集。在人工和真实数据流上的实验结果表明该算法通过扫描一次数据库获取挖掘结果，减少了候选项集的产生及时间和空间的消耗。该算法在数据流中能够有效地挖掘高效用项集。

著录项

来源
《计算机应用与软件》 |2015年第4期|283-287313|共6页
作者
慕欢欢; 柴玉梅; 王黎明;
展开▼
作者单位

郑州大学信息工程学院河南郑州450052;

郑州大学信息工程学院河南郑州450052;

郑州大学信息工程学院河南郑州450052;

展开▼
原文格式 PDF
正文语种 chi
中图分类自动推理、机器学习;
关键词
高效用; 数据流; 效用度量; 树结构;

相似文献

中文文献
外文文献
专利

1. 减少候选项集的数据流高效用项集挖掘算法 [J] . 茹蓓 ,贺新征 . 计算机应用研究 . 2017,第011期
2. 基于事务型滑动窗口的数据流中高效用项集挖掘算法 [J] . 宋威 ,刘明渊 ,李晋宏 . 南京大学学报：自然科学版 . 2014,第4期
3. 一种面向数据流的频繁项集挖掘算法 [J] . 孟彩霞 . 昆明理工大学学报：理工版 . 2009,第5期
4. 基于DBP的TOp-k高效用项集挖掘算法 [J] . 蒋华 ,路昕宇 ,王慧娇 . 计算机工程与设计 . 2021,第006期
5. 含负项top-k高效用项集挖掘算法 [J] . 孙蕊 ,韩萌 ,张春砚 . 计算机应用 . 2021,第008期
6. 时间敏感数据流上的频繁项集挖掘算法 [C] . LI Hai-Feng ,李海峰 ,ZHANG Ning . 第29届中国数据库学术会议 . 2012
7. 面向数据流的高效用项集挖掘算法研究 [A] . 慕欢欢 . 2014

面向数据流的一个高效用项集挖掘算法

摘要

著录项

相似文献

相关主题

期刊订阅