...
首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Fast Top-K Path-Based Relevance Query on Massive Graphs
【24h】

Fast Top-K Path-Based Relevance Query on Massive Graphs

机译:大规模图上基于Top-K路径的快速相关性查询

获取原文
获取原文并翻译 | 示例
           

摘要

Obtaining the items highly-relevant to a given set of query items is a key task for various applications, such as recommendation and relationship prediction. A family of path-based relevance metrics, which quantify item relevance based on the paths in an item graph, have been shown to be effective in capturing the relevance in many applications. Despite their effectiveness, path-based relevance normally requires time-consuming iterative computation. We propose an approach to obtain the top-k most relevant items for a given query item set quickly. Our approach uses novel score bounds to detect the emergence of the top-k items during the computation. The approach is designed for a distributed environment, which makes it scale for massive graphs having billions of nodes. Our experimental results show that the proposed approach can provide the results up to two order of magnitudes faster than previously proposed approaches and can scale well with both the size of input and the number of machines used in the computation.
机译:获得与给定查询项目集高度相关的项目是各种应用程序(例如推荐和关系预测)的关键任务。已经显示了一系列基于路径的相关性度量,这些度量基于项目图中的路径来量化项目相关性,在许多应用程序中可以有效地捕获相关性。尽管具有有效性,基于路径的相关性通常需要耗时的迭代计算。我们提出了一种快速获取给定查询项集的前k个最相关项的方法。我们的方法使用新颖的分数边界来检测计算过程中前k个项目的出现。该方法是为分布式环境设计的,这使其可以扩展到具有数十亿个节点的海图。我们的实验结果表明,与先前提出的方法相比,所提出的方法可以更快地提供两个数量级的结果,并且可以在输入大小和计算中使用的机器数量上很好地扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号