Hash based optimization for faster access to inverted index

机译：基于哈希的优化，可以更快地访问倒排索引

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Inverted Index is an important data structure in computer science. It is used to create a mapping between a word and the set of documents in which that word appears. Thus, it is used to store documents per word. Currently, the output of inverted indexing is stored haphazardly in a look up table. Hence traversing through the look up table for fetching indexes requires linear search. The time complexity of linear search is O(n) where n is the number of words whose inverted index has been stored. In this paper, a hash based optimization is proposed for storing the output of inverted index which can reduce the searching time complexity to O(1). Since inverted indexes are quite popular in big data applications like search engines, a MapReduce implementation of the proposed technique is also presented which can be easily implemented in a distributed environment.

机译：倒置指数是计算机科学中的重要数据结构。它用于在一个单词和显示该字的文件集之间创建映射。因此，它用于每个单词存储文档。目前，倒置索引的输出随意地存储在查找表中。因此，通过查找表来获取索引需要线性搜索。线性搜索的时间复杂性是O（n），其中n是已存储反转索引的单词数。在本文中，提出了一种用于存储反相索引的输出的散列优化，这可以将搜索时间复杂度降低到O（1）。由于倒置索引非常受到搜索引擎的大数据应用中，因此还呈现了所提出的技术的MapReduce实现，其可以在分布式环境中容易地实现。

著录项

来源
《International Conference on Inventive Computation Technologies》|2016年|1-5|共5页
会议地点
作者
Samarth Shah; Aadil Shaikh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Optimization; Indexing; Time complexity; Probes; Big data applications; Search engines;

机译：优化;索引编制;时间复杂度;探测;大数据应用;搜索引擎;

相似文献

外文文献
中文文献
专利

1. Fast document summarization using locality sensitive hashing and memory access efficient node ranking [J] . International Journal of Electrical and Computer Engineering . 2016,第3期

机译：使用位置敏感的哈希和高效的内存访问节点排名对文档进行快速汇总
2. Faster string matching based on hashing and bit-parallelism [J] . Al-Ssulami Abdulrakeeb M., Mathkour Hassan Information Processing Letters . 2017,第Jula期

机译：基于散列和位并行的更快的字符串匹配
3. FAST HASHING FUNCTION BASED ON MULTI-PIPELINE HASH CONSTRUCTION (MPHC) [J] . Ola A. Al-wesabi, Azman Samsudin, Nibras Abdullah International Journal of Innovative Computing Information and Control . 2012,第11期

机译：基于多管道哈希构造（MPHC）的快速哈希功能
4. Hash based optimization for faster access to inverted index [C] . Samarth Shah, Aadil Shaikh International Conference on Inventive Computation Technologies . 2016

机译：基于哈希的优化，更快地访问倒置索引
5. G-hash: Towards fast kernel-based similarity search in large graph databases. [D] . Wang, Xiaohong. 2009

机译：G哈希：在大型图形数据库中寻求基于内核的快速相似性搜索。
6. Fast and efficient short read mapping based on a succinct hash index [O] . Haowen Zhang, Yuandong Chan, Kaichao Fan, 2018

机译：基于简洁哈希索引的快速高效的短读映射
7. Optimizing Majority-Inverter Graphs With Functional Hashing [O] . Mathias Soeken, Luca Gaetano Amarù, Pierre-Emmanuel Gaillardon, 2016

机译：用功能散列优化多数逆变器图
8. Distributed Kernelized Locality-Sensitive Hashing for Faster Image Based Navigation. [R] . Hutchison, S. A. 2015

机译：基于分布式核心局部敏感哈希的快速图像导航。

Hash based optimization for faster access to inverted index

摘要

著录项

相似文献

相关主题

期刊订阅