...
首页> 外文期刊>Computer Science & Information Technology >Web Search Engine Based Semantic Similarity Measure Between Words Using Pattern Retrieval Algorithm
【24h】

Web Search Engine Based Semantic Similarity Measure Between Words Using Pattern Retrieval Algorithm

机译:基于网络搜索引擎的语义相似性测量,使用模式检索算法

获取原文
           

摘要

Semantic Similarity measures plays an important role in information retrieval, natural language processing and various tasks on web such as relation extraction, community mining, document clustering, and automatic meta-data extraction. In this paper, we have proposed a Pattern Retrieval Algorithm [PRA] to compute the semantic similarity measure between the words by combining both page count method and web snippets method. Four association measures are used to find semantic similarity between words in page count method using web search engines. We use a Sequential Minimal Optimization (SMO) support vector machines (SVM) to find the optimal combination of page counts-based similarity scores and top-ranking patterns from the web snippets method. The SVM is trained to classify synonymous word-pairs and non- synonymous word-pairs. The proposed approach aims to improve the Correlation values, Precision, Recall, and F-measures, compared to the existing methods. The proposed algorithm outperforms by 89.8 % of correlation value.
机译:语义相似度测量在信息检索,自然语言处理和网上的各种任务中起重要作用,例如关系提取,社区挖掘,文档聚类和自动元数据提取。在本文中,我们提出了一种模式检索算法[PRA]来通过组合页面计数方法和Web代码段方法来计算单词之间的语义相似度测量。四个关联措施用于使用Web搜索引擎在页面计数方法中的单词之间找到语义相似性。我们使用顺序最小优化(SMO)支持向量机(SVM)来查找来自Web代码段方法的页面计数的相似性分数和排名模式的最佳组合。 SVM受过培训,以分类同义词对和非同义词对。与现有方法相比,拟议的方法旨在改善相关价值,精度,召回和F措施。所提出的算法优于89.8%的相关值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号