首页> 美国卫生研究院文献>SpringerPlus >A distributed query execution engine of big attributed graphs
【2h】

A distributed query execution engine of big attributed graphs

机译:大属性图的分布式查询执行引擎

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

A graph is a popular data model that has become pervasively used for modeling structural relationships between objects. In practice, in many real-world graphs, the graph vertices and edges need to be associated with descriptive attributes. Such type of graphs are referred to as attributed graphs. G-SPARQL has been proposed as an expressive language, with a centralized execution engine, for querying attributed graphs. G-SPARQL supports various types of graph querying operations including reachability, pattern matching and shortest path where any G-SPARQL query may include value-based predicates on the descriptive information (attributes) of the graph edges/vertices in addition to the structural predicates. In general, a main limitation of centralized systems is that their vertical scalability is always restricted by the physical limits of computer systems. This article describes the design, implementation in addition to the performance evaluation of DG-SPARQL, a distributed, hybrid and adaptive parallel execution engine of G-SPARQL queries. In this engine, the topology of the graph is distributed over the main memory of the underlying nodes while the graph data are maintained in a relational store which is replicated on the disk of each of the underlying nodes. DG-SPARQL evaluates parts of the query plan via SQL queries which are pushed to the underlying relational stores while other parts of the query plan, as necessary, are evaluated via indexless memory-based graph traversal algorithms. Our experimental evaluation shows the efficiency and the scalability of DG-SPARQL on querying massive attributed graph datasets in addition to its ability to outperform the performance of Apache Giraph, a popular distributed graph processing system, by orders of magnitudes.
机译:图形是一种流行的数据模型,已广泛用于建模对象之间的结构关系。实际上,在许多实际图形中,图形的顶点和边都需要与描述性属性相关联。这种类型的图被称为属性图。 G-SPARQL已被提出为一种具有集中式执行引擎的表达语言,用于查询属性图。 G-SPARQL支持各种类型的图查询操作,包括可达性,模式匹配和最短路径,其中任何G-SPARQL查询除结构谓词外,还可在图边/顶点的描述性信息(属性)上包含基于值的谓词。通常,集中式系统的主要限制是它们的垂直可伸缩性始终受计算机系统的物理限制。本文除了介绍DG-SPARQL的性能评估之外,还介绍了G-SPARQL查询的分布式,混合和自适应并行执行引擎DG-SPARQL的设计,实现。在此引擎中,图的拓扑分布在基础节点的主存储器上,而图数据保留在关系存储中,该关系存储复制在每个基础节点的磁盘上。 DG-SPARQL通过SQL查询评估查询计划的某些部分,这些查询被推送到基础关系存储中,而查询计划的其他部分则根据需要通过基于无索引内存的图遍历算法进行评估。我们的实验评估表明,DG-SPARQL在查询海量归因图数据集方面具有效率和可扩展性,此外,它还具有比流行的分布式图处理系统Apache Giraph的性能好几个数量级的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号