首页> 外国专利> Linguistic disambiguation system and method using string-based pattern training to learn to resolve ambiguity sites

Linguistic disambiguation system and method using string-based pattern training to learn to resolve ambiguity sites

机译:使用基于字符串的模式训练来学习解决歧义位点的语言歧义消除系统和方法

摘要

A linguistic disambiguation system and method creates a knowledge base by training on patterns in strings that contain ambiguity sites. The string patterns are described by a set of reduced regular expressions (RREs) or very reduced regular expressions (VRREs). The knowledge base utilizes the RREs or VRREs to resolve ambiguity based upon the strings in which the ambiguity occurs. The system is trained on a training set, such as a properly labeled corpus. Once trained, the system may then apply the knowledge base to raw input strings that contain ambiguity sites. The system uses the RRE- and VRRE-based knowledge base to disambiguate the sites.
机译:语言歧义消除系统和方法通过训练包含歧义位点的字符串中的模式来创建知识库。字符串模式由一组简化的正则表达式(RRE)或非常简化的正则表达式(VRRE)来描述。知识库根据发生歧义的字符串利用RRE或VRRE来解决歧义。该系统在训练集上训练,例如正确标记的语料库。一旦经过培训,系统便可以将知识库应用于包含歧义位点的原始输入字符串。该系统使用基于RRE和VRRE的知识库来消除站点歧义。

著录项

  • 公开/公告号US6947918B2

    专利类型

  • 公开/公告日2005-09-20

    原文格式PDF

  • 申请/专利权人 ERIC D. BRILL;

    申请/专利号US20030629387

  • 发明设计人 ERIC D. BRILL;

    申请日2003-07-29

  • 分类号G06F17/00;G06N5/00;

  • 国家 US

  • 入库时间 2022-08-21 22:20:02

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号