首页> 外国专利> RETRIEVAL METHOD AND SYSTEM BASED ON WORD VECTOR SIMILARITY

RETRIEVAL METHOD AND SYSTEM BASED ON WORD VECTOR SIMILARITY

机译:基于词向量相似度的检索方法和系统

摘要

A retrieval method and system based on a word vector similarity. The method comprises: performing word vector training on a retrieval library, and establishing a training model corresponding to the retrieval library (S1); receiving an input retrieval keyword, and obtaining related words of the retrieval keyword and the similarity between each of the related words and the retrieval keyword by means of the training model (S2); retrieving and matching the retrieval library using the related words, and respectively counting scores of matching between various files in the retrieval library and the related words according to the similarity (S3); and sorting the files in the retrieval library according to the matching scores from high to low, and outputting a retrieval result according to the sorting result (S4). By means of the method, the capabilities of retrieving and matching related words can be enhanced in combination with the lexical characteristics in various retrieval libraries, thereby improving the accuracy rate and the robustness of retrieval.
机译:基于词向量相似度的检索方法和系统。该方法包括:对检索库进行词向量训练,并建立与检索库相对应的训练模型(S1);接收输入的检索关键词,并通过训练模型获得所述检索关键词的相关词以及每个相关词与所述检索词之间的相似度(S2);利用所述相关词对所述检索库进行检索和匹配,并根据所述相似度分别计算所述检索库中各个文件与所述相关词的匹配分数(S3);根据匹配分数从高到低对检索库中的文件进行排序,并根据排序结果输出检索结果(S4)。通过该方法,结合各种检索库中的词汇特征,可以增强相关词的检索和匹配能力,从而提高了检索的准确率和鲁棒性。

著录项

  • 公开/公告号WO2017107566A1

    专利类型

  • 公开/公告日2017-06-29

    原文格式PDF

  • 申请/专利权人 GUANGZHOU SHIYUAN ELECTRONICS CO. LTD.;

    申请/专利号WO2016CN98234

  • 发明设计人 LI XIAN;

    申请日2016-09-06

  • 分类号G06F17/30;

  • 国家 WO

  • 入库时间 2022-08-21 13:30:40

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号