首页> 外国专利> Method and apparatus for improved speech recognition by modifying a pronunciation dictionary based on pattern definitions of alternate word pronunciations

Method and apparatus for improved speech recognition by modifying a pronunciation dictionary based on pattern definitions of alternate word pronunciations

机译:通过基于替代单词发音的模式定义修改发音词典来改进语音识别的方法和设备

摘要

An approach for automatically modifying a pronunciation dictionary in a speech recognition system based on patterns of alternate pronunciations is described. A representation of the pronunciation dictionary, such as a plurality of dynamically linked phoneme values, is obtained. One or more pattern definitions are obtained. The pattern definitions specify zero or more phonemes to be substituted for zero or more phonemes of all words in the pronunciation dictionary. The linked phoneme values are modified by adding, for each path of each word, alternate paths that use each of the substitute phonemes according to the pattern definitions, thereby creating an expanded set of dynamically linked phoneme values. One or more example pronunciations of a particular word are then obtained. One or more best paths through the expanded set of phoneme values are determined for each of the example pronunciations and used to find the overall best path(s). For the overall best path(s), an alternate word pronunciation is constructed by converting each path into a pronunciation using the format of the pronunciation dictionary. The pronunciation dictionary is modified by adding each alternate word pronunciation. As a result, a modified pronunciation dictionary is created that accounts for alternate pronunciations as actually spoken by users of a particular speech recognition application.
机译:描述了一种基于替代发音模式在语音识别系统中自动修改发音词典的方法。获得发音词典的表示,例如多个动态链接的音素值。获得一个或多个模式定义。模式定义指定零个或多个音素来代替发音词典中所有单词的零个或多个音素。通过为每个单词的每个路径添加根据模式定义使用每个替代音素的备用路径来修改链接的音素值,从而创建一组动态链接的音素值。然后获得特定单词的一个或多个示例发音。为每个示例发音确定通过扩展音素值集的一个或多个最佳路径,并将其用于查找总体最佳路径。对于总体最佳路径,通过使用发音词典的格式将每个路径转换为发音来构造备用单词发音。通过添加每个替代单词的发音来修改发音词典。结果,创建了修改的发音词典,其解释了由特定语音识别应用的用户实际说出的替代发音。

著录项

  • 公开/公告号US6389394B1

    专利类型

  • 公开/公告日2002-05-14

    原文格式PDF

  • 申请/专利权人 SPEECHWORKS INTERNATIONAL INC.;

    申请/专利号US20000501341

  • 发明设计人 MARK FANTY;

    申请日2000-02-09

  • 分类号G10L150/00;

  • 国家 US

  • 入库时间 2022-08-22 00:49:23

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号