首页> 外国专利> MORPHEME ANALYSIS DEVICE, MORPHEME ANALYSIS METHOD, MORPHEME ANALYSIS PROGRAM, AND RECORDING MEDIUM WITH COMPUTER PROGRAM RECORDED THEREON

MORPHEME ANALYSIS DEVICE, MORPHEME ANALYSIS METHOD, MORPHEME ANALYSIS PROGRAM, AND RECORDING MEDIUM WITH COMPUTER PROGRAM RECORDED THEREON

机译:形态分析设备,形态分析方法,形态分析程序,以及记录有计算机程序的记录介质

摘要

PPROBLEM TO BE SOLVED: To provide a morpheme analysis device for acquiring a proper morpheme analysis result even when any undefined word exists. PSOLUTION: This morpheme analysis device is provided with a retrieval result requesting means for, when any undefined word which is not stored in a word dictionary storage means 140 exists in a character string, requesting the retrieval result to an internal or external retrieval device 50 on the basis of the undefined word as retrieval conditions; a document vector calculation means for calculating the whole or a portion of the retrieval result as one document; a similarity calculation means for calculating the similarity of the document vector of the undefined word with the document vector of a known word; a similar word specification means for specifying a similar word as a known word corresponding to the document vector whose similarity is high; and an attribute application means for associating a part of speech and costs of the similar word with the undefined word. The division means is configured to divide the input character string into units by using the part of speech and costs associated with the undefined word by the undefined word attribute application means. PCOPYRIGHT: (C)2009,JPO&INPIT
机译:

要解决的问题:提供一种词素分析设备,即使存在任何未定义的单词,该词素分析设备也可以获取适当的词素分析结果。

解决方案:该词素分析装置设有检索结果请求装置,用于当字符串中未存储在单词词典存储装置140中的任何未定义的单词存在时,向内部或外部检索请求检索结果。装置50基于未定义的单词作为检索条件;文件矢量计算装置,用于将全部或部分检索结果计算为一个文件;一种相似度计算装置,用于计算未定义词的文档向量与已知词的文档向量的相似度;相似词指定装置,用于将相似词指定为与相似度高的文档向量相对应的已知词;属性应用装置,用于将相似词的词性和成本与未定义词相关联。所述划分装置被配置为通过使用由所述未定义词属性应用装置与所述未定义词相关联的词性和成本,将输入字符串划分为多个单元。

版权:(C)2009,日本特许厅&INPIT

著录项

  • 公开/公告号JP2008276561A

    专利类型

  • 公开/公告日2008-11-13

    原文格式PDF

  • 申请/专利权人 YAHOO JAPAN CORP;

    申请/专利号JP20070119982

  • 发明设计人 MASUYAMA TAKESHI;MAKINODA SHIGEO;

    申请日2007-04-27

  • 分类号G06F17/27;

  • 国家 JP

  • 入库时间 2022-08-21 19:44:06

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号