首页> 外文会议>International conference on web information systems engineering >Discovering Functional Dependencies in Vertically Distributed Big Data
【24h】

Discovering Functional Dependencies in Vertically Distributed Big Data

机译:在垂直分布的大数据中发现功能依赖关系

获取原文

摘要

The issue of discovering FDs has received a great deal of attention in the database research community. However, as the problem is exponential in the number of attributes, existing approaches can only be applied on small centralized datasets. It is challenging to discover FDs from big data, especially if data is distributed. We present a new algorithm DFDD for discovering all functional dependencies in parallel in vertically distributed big data following a breadth-first traversal strategy of the attribute lattice that combines efficient pruning. We verify experimentally that our approach can process distributed big datasets and it is scalable with the number of cluster nodes and the size of datasets.
机译:发现FDS的问题在数据库研究界中受到了大量的关注。但是,由于问题在属性的数量中是指数的,因此只能在小集中式数据集上应用现有方法。发现来自大数据的FDS有挑战性,特别是如果数据分布式。我们介绍了一种新的算法DFDD,用于在组合有效修剪的属性格子的广度遍历策略之后在垂直分布的大数据中发现所有功能依赖性。我们通过实验验证,我们的方法可以处理分布式的大数据集,它可以使用群集节点的数量和数据集的大小来缩放。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号