An Experimental Evaluation of SimRank-based Similarity Search Algorithms

机译：基于SimRank的相似性搜索算法的实验评估

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Given a graph, SimRank is one of the most popular measures of the similarity between two vertices. We focus on efficiently calculating SimRank, which has been studied intensively over the last decade. This has led to many algorithms that efficiently calculate or approximate SimRank being proposed by researchers. Despite these abundant research efforts, there is no systematic comparison of these algorithms. In this paper, we conduct a study to compare these algorithms to understand their pros and cons. We first introduce a taxonomy for different algorithms that calculate SimRank and classify each algorithm into one of the following three classes, namely, iterative-, non-iterative-, and random walk-based method. We implement ten algorithms published from 2002 to 2015, and compare them using synthetic and real-world graphs. To ensure the fairness of our study, our implementations use the same data structure and execution framework, and we try our best to optimize each of these algorithms. Our study reveal-s that none of these algorithms dominates the others: algorithms based on iterative method often have higher accuracy while algorithms based on random walk can be more scalable. One non-iterative algorithm has good effectiveness and efficiency on graphs with medium size. Thus, depending on the requirements of different applications, the optimal choice of algorithms differs. This paper provides an empirical guideline for making such choices.

机译：给定一个图，SimRank是两个顶点之间相似度最受欢迎的度量之一。我们专注于有效地计算SimRank，在过去十年中对此进行了深入研究。这导致研究人员提出了许多有效计算或近似SimRank的算法。尽管进行了大量的研究工作，但这些算法尚无系统的比较。在本文中，我们进行了一项比较这些算法的研究，以了解它们的优缺点。我们首先介绍用于计算SimRank的不同算法的分类法，并将每种算法分为以下三类之一，即基于迭代，非迭代和基于随机游走的方法。我们实施了从2002年到2015年发布的十种算法，并使用合成图和真实图进行比较。为了确保我们研究的公平性，我们的实现使用相同的数据结构和执行框架，并且我们会尽力优化每种算法。我们的研究揭示了-这些算法中没有一个能在其他算法中占主导地位：基于迭代方法的算法通常具有更高的准确性，而基于随机游走的算法可以具有更高的可扩展性。一种非迭代算法对中等大小的图具有良好的有效性和效率。因此，根据不同应用程序的需求，算法的最佳选择会有所不同。本文提供了进行此类选择的经验指导。

著录项

来源
《International conference on very large data bases》|2017年|601-612|共12页
会议地点
作者
Zhipeng Zhang; Yingxia Shao; Bin Cui; Ce Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. LC-MS/MS Software for Screening Unknown Erectile Dysfunction Drugs and Analogues: Artificial Neural Network Classification, Peak-Count Scoring, Simple Similarity Search, and Hybrid Similarity Search Algorithms [J] . Jang Inae, Lee Jae-ung, Lee Jung-min, Analytical chemistry . 2019,第14期

机译：LC-MS / MS软件用于筛选未知的勃起功能障碍药物和类似物：人工神经网络分类，峰值计数评分，简单的相似性搜索和混合相似性搜索算法
2. FAIR RESOURCE ALLOCATION IN A SIMPLE MULTI-AGENT SETTING: SEARCH ALGORITHMS AND EXPERIMENTAL EVALUATION [J] . PARASKEVI RAFTOPOULOU, MANOLIS KOUBARAKIS, KOSTAS STERGIOU, International Journal of Artificial Intelligence Tools: Architectures, Languages, Algorithms . 2005,第6期

机译：简单的多代理设置中的公平资源分配：搜索算法和实验评估
3. Generating customised experimental stimuli for visual search using Genetic Algorithms shows evidence for a continuum of search efficiency. [J] . Verma M, McOwan PW Vision Research: An International Journal in Visual Science . 2009,第3期

机译：使用遗传算法生成用于视觉搜索的定制实验刺激，可以证明连续的搜索效率。
4. An Experimental Evaluation of SimRank-based Similarity Search Algorithms [C] . Zhipeng Zhang, Yingxia Shao, Bin Cui, International conference on very large data bases . 2017

机译：基于SIMRANK的相似性搜索算法的实验评估
5. New Algorithmic Tools for Distributed Similarity Search and Edge Estimation [D] . Rashtchian, Cyrus. 2018

机译：用于分布式相似性搜索和边缘估计的新算法工具
6. An experimental evaluation of the incidence of fitness-function/search-algorithm combinations on the classification performance of myoelectric control systems with iPCA tuning [O] . Guillermo A Camacho, Carlos H Llanos, Pedro A Berger, 2013

机译：iPCA调整的适应度函数/搜索算法组合对肌电控制系统分类性能的影响的实验评估
7. Generating customised experimental stimuli for visual search using Genetic Algorithms shows evidence for a continuum of search efficiency [O] . Verma Milan, McOwan Peter W. 2009

机译：使用遗传算法生成用于视觉搜索的定制实验刺激，可以证明连续的搜索效率

An Experimental Evaluation of SimRank-based Similarity Search Algorithms

摘要

著录项

相似文献

相关主题

期刊订阅