首页> 外文期刊>American journal of applied sciences >Model for Load Balancing On Processors in Parallel Mining of Frequent Itemsets | Science Publications
【24h】

Model for Load Balancing On Processors in Parallel Mining of Frequent Itemsets | Science Publications

机译:频繁项集并行挖掘中的处理器负载平衡模型科学出版物

获取原文
           

摘要

> The existence of many large transactions distributed databases with high data schemas, the centralized approach for mining association rules in such databases will not be feasible. Some distributed algorithms have been developed [FDM, CD], but none of them have considered the problem of data skews in distributed mining of association rules. The skewness of datasets reduces the workload balancing between processors involved in distributed mining of association rules. It is important to invent an efficient approach for distributed mining of association rules which have the ability to generate homogeneous partitions of the whole data sets; hence the supports of most large item sets are distributed evenly across the processors. We proposed an efficient stratified sampling based partitioned technique, which generate homogeneous partitions on which processors works in parallel and generate their local concepts approximately simultaneously.
机译: >许多具有高数据模式的大型事务分布式数据库的存在,在这种数据库中集中挖掘关联规则的方法将不可行。已经开发了一些分布式算法[FDM,CD],但是没有一个算法考虑关联规则的分布式挖掘中的数据偏斜问题。数据集的偏斜度减少了关联规则的分布式挖掘中涉及的处理器之间的工作负载平衡。重要的是发明一种有效的方法来进行关联规则的分布式挖掘,该规则具有生成整个数据集的同质分区的能力。因此,大多数大型项目集的支持会在处理器之间平均分配。我们提出了一种有效的基于分层采样的分区技术,该技术可生成同类分区,处理器可在这些同类分区上并行工作,并大致同时生成其本地概念。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号