A Highest Sense Count Based Method for Disambiguation of Web Queries for Hindi Language Web Information Retrieval

Sanjay K. Dwivedi

首页> 外文期刊>International journal of information retrieval research >A Highest Sense Count Based Method for Disambiguation of Web Queries for Hindi Language Web Information Retrieval

【24h】

A Highest Sense Count Based Method for Disambiguation of Web Queries for Hindi Language Web Information Retrieval

机译：基于最高感计数的印地语Web信息检索中的Web查询消歧方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The ambiguity in word senses has been recognized as a major challenge for the information retrieval systems. Hindi language web information retrieval, like other languages, faces the problem of sense ambiguity. The sense ambiguity problem deteriorates the performance of every natural language processing (NLP) application. The performance of Hindi language web information retrieval is also affected by it. In this paper, the author formalized an approach for the disambiguation of the senses to improve the performance of Hindi web information retrieval. Our system works in such a way that ambiguity detection has been performed before disambiguation of web queries. Test samples of 100 queries have been selected. When these queries were subjected to ambiguity detection, we found that 43% of them have been detected unambiguous. After ambiguity detection, the disambiguation approach is followed which is based on HSC (Highest Sense Count). Query disambiguation approach further follows query expansion. The expanded query generates the new result set which results into high precision and high similarity score. The 57 expanded queries are tested against 1000 test document instances. The overall improvement is 45% in the average precision, 23% in interpolated average precision and a significant improvement in the average similarity score of the new generated result set. The overall accuracy of our approach has been 61.4% and it improves the performance of the system by 45%.

机译：词义上的歧义已被认为是信息检索系统的主要挑战。像其他语言一样，印地语网络信息检索也面临着含糊不清的问题。含糊不清的问题恶化了每个自然语言处理（NLP）应用程序的性能。印地语网络信息检索的性能也受其影响。在本文中，作者正式提出了一种消除歧义的方法，以提高印地语网络信息检索的性能。我们的系统以这样的方式工作：在对Web查询进行歧义消除之前已经执行了歧义检测。已选择100个查询的测试样本。对这些查询进行歧义检测后，我们发现其中有43％的歧义被检测到。在进行歧义检测之后，遵循基于HSC（最高感知计数）的歧义消除方法。查询消歧方法进一步遵循查询扩展。扩展的查询生成新的结果集，该结果集导致高精度和高相似性得分。针对1000个测试文档实例测试了57个扩展查询。总体改进后的平均精度为45％，插值平均精度为23％，新生成的结果集的平均相似性得分显着提高。我们的方法的总体准确性为61.4％，它将系统的性能提高了45％。

著录项

来源
《International journal of information retrieval research》 |2012年第4期|1-11|共11页
作者
Sanjay K. Dwivedi;
展开▼
作者单位

Department of Computer Science, Babasaheb Bhimrao Ambedkar University, Lucknaw, Uttar Pradesh, India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Highest Sense Count (HSC); Hindi Language; Query Expansion; Web Information Retrieval; Word Sense Disambiguation;

机译：最高感官计数（HSC）;印地语查询扩展;Web信息检索;词义消歧;

相似文献

外文文献
中文文献
专利

1. An Entropy Based Method for Removing Web Query Ambiguity in Hindi Language | Science Publications [J] . P. Rastogi, S. K. Dwivedi Journal of computer sciences . 2008,第9期

机译：印地语的基于熵的Web查询歧义消除方法科学出版物
2. Performance Evaluation of Different Similarity Functions and Classification Methods Using Web Based Hindi Language Question Answering System [J] . Rajni Devi, Mohit Dua Procedia Computer Science . 2016,第1期

机译：基于Web的印地语语言问答系统对不同相似度函数和分类方法的性能评估
3. A smart web query method for semantic retrieval of web data [J] . Roger H.L. Chiang, Cecil Eng Huang Chua, Veda C. Storey Computers & Structures . 2001,第12期

机译：一种用于Web数据语义检索的智能Web查询方法
4. Query Disambiguation for Cross-Language Information Retrieval Using Web Directories [C] . Kimura, F., Maeda, . 2005

机译：使用Web目录进行查询查询歧义以进行跨语言信息检索
5. Long query as an effective method for improving the quality of information retrieval on the Web. [D] . Taksa, Isak. 2002

机译：长查询是提高Web上信息检索质量的有效方法。
6. A Web Search Method Based on the Temporal Relation of Query Keywords [O] . Tomoyo Kage, Kazutoshi Sumiya -1

机译：基于查询关键词时间关系的Web搜索方法
7. An Entropy Based Method for Removing Web Query Ambiguity in Hindi Language [O] . S. K. Dwivedi, Parul Rastogi 2010

机译：基于熵的北印度语Web查询歧义消除方法

A Highest Sense Count Based Method for Disambiguation of Web Queries for Hindi Language Web Information Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅