首页> 外文期刊>International journal of data mining, modelling and management >An efficient method for discovery of large item sets
【24h】

An efficient method for discovery of large item sets

机译:发现大型项目集的有效方法

获取原文
获取原文并翻译 | 示例
           

摘要

In today's emerging field of descriptive data mining, association rule mining (ARM) has been proven helpful to describe essential characteristics of data from large databases. Mining frequent item sets is the fundamental task of ARM. Apriori, the most influential traditional ARM algorithm adopts iterative search strategy for frequent item set generation. But, multiple scans of database, candidate item set generation and large load of system's I/O are major abuses which degrade the mining performance of it. Therefore, we proposed a new method for mining frequent item sets which overcomes these shortcomings. It judges the importance of occurrence of an item set by counting present and absent count of an individual item. Performance evaluation with Apriori algorithm shows that proposed method is more efficient as it finds fewer items in frequent item set in 50% less time without backtracking. It also reduces system I/O load by scanning the database only once.
机译:在当今新兴的描述性数据挖掘领域,已证明关联规则挖掘(ARM)有助于描述大型数据库中数据的基本特征。挖掘频繁的项目集是ARM的基本任务。最有影响力的传统ARM算法Apriori采用迭代搜索策略来生成频繁的项目集。但是,对数据库的多次扫描,候选项目集的生成以及系统I / O的大量负载都是主要的滥用行为,这会降低数据库的挖掘性能。因此,我们提出了克服这些缺点的一种新的频繁项目集挖掘方法。它通过对单个项目的当前和不存在进行计数来判断出现某个项目集的重要性。用Apriori算法进行性能评估表明,该方法效率更高,因为它可以在不回溯的情况下以50%的时间在频繁项目集中找到较少的项目。通过仅扫描数据库一次,它还减少了系统I / O负载。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号