A new efficient approach for mining uncertain frequent patterns using minimum data structure without false positives

Gangin Lee; Unil Yun

首页> 外文期刊>Future generation computer systems >A new efficient approach for mining uncertain frequent patterns using minimum data structure without false positives

【24h】

A new efficient approach for mining uncertain frequent patterns using minimum data structure without false positives

机译：一种新的有效方法，使用最少的数据结构来挖掘不确定的频繁模式，而不会产生误报

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The concept of uncertain pattern mining was recently proposed to fulfill the demand for processing databases with uncertain data, and various relevant methods have been devised. However, previous approaches have the following limitations. State-of-the-art methods based on tree structure can cause fatal problems in terms of runtime and memory usage according to the characteristics of uncertain databases and threshold settings because their own tree data structures can become excessively large and complicated in their mining processes. Various approximation approaches have been suggested in order to overcome such problems; however, they are methods that increase their own mining performance at the cost of accuracy of the mining results. In order to solve the problems, we propose an exact, efficient algorithm for mining uncertain frequent patterns based on novel data structures and mining techniques, which can also guarantee the correctness of the mining results without any false positives. The newly proposed list-based data structures and pruning techniques allow a complete set of uncertain frequent patterns to be mined more efficiently without pattern losses. We also demonstrate that the proposed algorithm outperforms previous state-of-the art approaches in both theoretical and empirical aspects. Especially, we provide analytical results of performance evaluation for various types of datasets to show efficiency of runtime, memory usage, and scalability in our method.

机译：最近提出了不确定模式挖掘的概念，以满足处理具有不确定数据的数据库的需求，并且已经设计了各种相关方法。但是，先前的方法具有以下局限性。基于树形结构的最新方法会根据不确定的数据库和阈值设置的特性在运行时和内存使用方面造成致命问题，因为它们自己的树形数据结构在挖掘过程中可能变得过大和复杂。为了克服这些问题，已经提出了各种近似方法。但是，这些方法会以挖掘结果的准确性为代价来提高自身的挖掘性能。为了解决这些问题，我们提出了一种基于新颖的数据结构和挖掘技术的精确，高效的不确定频繁模式挖掘算法，该算法还可以保证挖掘结果的正确性而不会产生误报。新近提出的基于列表的数据结构和修剪技术允许更有效地挖掘一整套不确定的频繁模式，而不会造成模式损失。我们还证明，在理论和经验方面，所提出的算法均优于以前的最新方法。尤其是，我们提供了各种类型数据集的性能评估分析结果，以显示我们的方法的运行时效率，内存使用率和可伸缩性。

著录项

来源
《Future generation computer systems》 |2017年第3期|89-110|共22页
作者
Gangin Lee; Unil Yun;
展开▼
作者单位

Department of Computer Engineering, Sejong University, Seoul, Republic of Korea;

Department of Computer Engineering, Sejong University, Seoul, Republic of Korea;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Correctness; Data mining; Existential probability; Frequent pattern mining; Uncertain pattern;

机译：正确性;数据挖掘;存在概率;频繁的模式挖掘;不确定的模式;

相似文献

外文文献
中文文献
专利

1. UDFP-TREE: AN EFFICIENT TREE FOR INTERACTIVE MINING OF FREQUENT PATTERNS FROM UNCERTAIN DATA [J] . MOHAMMAD H. NADIMI-SHAHRAKI, FATEMEH HABIBOLLAHI, HAMID RASTEGARI Journal of Theoretical and Applied Information Technology . 2015,第3期

机译：UDFP-TREE：从不确定数据交互挖掘频繁模式的有效树
2. Efficient Algorithms for the Mining of Constrained Frequent Patterns from Uncertain Data [J] . Carson Kai-Sang Leung, Dale A. Brajczuk SIGKDD explorations . 2009,第2期

机译：从不确定数据挖掘约束频繁模式的高效算法
3. An Efficient Mining Approach of Frequent Data Item Sets on Large Uncertain Databases [J] . Isse Hassan Sheikh Nur International Journal of Computer Trends and Technology . 2015,第1期

机译：大型不确定数据库上频繁数据项集的有效挖掘方法
4. An Efficient Approach for Mining Frequent Patterns over Uncertain Data Streams [C] . Md. Badi-Uz-Zaman Shajib, Md. Samiullah, Chowdhury Farhan Ahmed, IEEE International Conference on Tools with Artificial Intelligence . 2016

机译：一种在不确定数据流上挖掘频繁模式的有效方法
5. Mining Frequent Patterns from Uncertain Data with MapReduce [D] . Hayduk, Yaroslav 2012

机译：使用MapReduce从不确定的数据中挖掘频繁模式
6. Hyper-structure mining of frequent patterns in uncertain data streams [O] . Chandima HewaNadungodage, Yuni Xia, Jaehwan John Lee, -1

机译：不确定数据流中频繁模式的超结构挖掘
7. Frequent Pattern Mining based on Multiple Minimum Support using Uncertain Dataset [O] . Meenu Dave Ph. D, Hitesh Maharwal, M. Tech Scholar 2015

机译：基于不确定数据集的多重最小支持频繁模式挖掘
8. Crime Pattern Analysis: A Spatial Frequent Pattern Mining Approach. [R] . D. Oliver P. Mohan S. Shekhar X. Zhou 2012

机译：犯罪模式分析：一种空间频繁模式挖掘方法。

A new efficient approach for mining uncertain frequent patterns using minimum data structure without false positives

摘要

著录项

相似文献

相关主题

期刊订阅