首页> 外文OA文献 >Predictive Dynamic Load Balancing of Parallel Hash-Joins over Heterogeneous Processors in the Presence of Data Skew
【2h】

Predictive Dynamic Load Balancing of Parallel Hash-Joins over Heterogeneous Processors in the Presence of Data Skew

机译:存在数据时滞的异构处理器上并行哈希联接的预测动态负载平衡

摘要

In this paper, we present new algorithms to balance the computation of parallel hash joins over heterogeneous processors in the presence of data skew and external loads. Heterogeneity in our model consists of disparate computing elements, as well as general purpose computing ensembles that are subject to external loading. Data skew appears as significant nonuniformities in the distribution of attribute values of underlying relations that are involved in a join. We develop cost models and predictive dynamic load balancing protocols to detect imbalance during the computation of a single large join. Our algorithms can account for imbalance due to data skew as well as heterogeneity in the computing environment. Significant performance gains are reported for a wide range of test cases on a prototype implementation of the system.
机译:在本文中,我们提出了新的算法,以在存在数据偏斜和外部负载的情况下平衡异构处理器上并行哈希连接的计算。我们模型中的异质性由不同的计算元素以及受外部负载影响的通用计算集合组成。数据偏斜表现为联接中涉及的基础关系的属性值的分布中的重大不均匀性。我们开发成本模型和预测性动态负载平衡协议,以在单个大连接的计算过程中检测不平衡。我们的算法可以解决由于数据偏斜以及计算环境中的异构性造成的不平衡。在系统的原型实现中,针对大量测试案例报告了显着的性能提升。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号