...
首页> 外文期刊>Expert Systems with Application >Fault-tolerant tile mining
【24h】

Fault-tolerant tile mining

机译:容错瓷砖开采

获取原文
获取原文并翻译 | 示例
           

摘要

Interesting itemset mining is a fundamental research problem in knowledge management and machine learning. It is intended to identify interesting relations between variables in a database using some measures of interestingness and has a number of applications, including market basket analysis, web usage mining, intrusion detection, and many others. This paper proposes a new interestingness measure, the fault-tolerant tile. That is based on two observations: (1) the length of an itemset can be as important as its frequency; (2) knowledge discovery from real-world datasets calls for fault -tolerant data mining (e.g. extracting fault -tolerant association rules, analyzing noisy datasets). Given a user-defined fault tolerance value, we are interested in finding the maximum/top-k fault-tolerant tiles. Due to the exponential search space of candidate itemsets, both problems are NP-hard. While using some monotonic property to prune search space is a common strategy for interesting itemset mining, no monotonic property is available for this problem. To tackle the challenge, we utilize the branch-and-bound search strategy to analyze the characteristics of candidate itemsets at each searching branch and estimating their bounds. Our experimental results show that our algorithms can effectively analyze real datasets and retrieve meaningful results. (C) 2018 Elsevier Ltd. All rights reserved.
机译:有趣的项目集挖掘是知识管理和机器学习中的一个基本研究问题。它旨在使用一些有趣程度来确定数据库中变量之间的有趣关系,并具有许多应用程序,包括市场篮分析,Web使用挖掘,入侵检测等。本文提出了一种新的有趣措施,容错块。这基于两个观察结果:(1)项目集的长度可能与其频率一样重要; (2)从现实世界的数据集中发现知识需要进行容错数据挖掘(例如,提取容错关联规则,分析嘈杂的数据集)。给定用户定义的容错值,我们有兴趣找到最大/前k个容错区块。由于候选项目集的指数搜索空间,两个问题都是NP-难的。尽管使用一些单调属性来修剪搜索空间是用于有趣项集挖掘的常用策略,但没有单调属性可用于此问题。为了应对这一挑战,我们利用分支和边界搜索策略来分析每个搜索分支处候选项目集的特征并估计其边界。我们的实验结果表明,我们的算法可以有效地分析实际数据集并检索有意义的结果。 (C)2018 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号