首页> 外国专利> HYBRID PHONEME, DIPHONE, MORPHEME, AND WORD-LEVEL DEEP NEURAL NETWORKS

HYBRID PHONEME, DIPHONE, MORPHEME, AND WORD-LEVEL DEEP NEURAL NETWORKS

机译:混合音素,双音素,语气和单词级深层神经网络

摘要

An approach of hybrid frame, phone, diphone, morpheme, and word-level Deep Neural Networks (DNN) in model training and applications is described. The approach can be applied to many applications. The approach is based on a regular ASR system, which can be based on Gaussian Mixture Models (GMM) or DNN. In the first step, a regular ASR model is trained. All the training data (in the format of features) are aligned with the transcripts in terms of phonemes and words with the timing information. Feature normalization can be applied for these new features. Based on the alignment timing information, new features are formed in terms of phonemes, diphones, morphemes, and up to words. A first pass regular speech recognition is performed, and the result lattice is produced. In the lattice, there is the timing information for each word. A feature is then extracted and sent to the word-level DNN for scoring. If the word is not in the word-level DNN vocabulary, then a forced alignment is performed to get the timing information for each phoneme. Then features from these phonemes, diphones, and morphemes are sent to the corresponding DNNs for training. And these scores are combined to form the word level scores. In this way, the lattice is rescored, and a new recognition result is produced.
机译:描述了在模型训练和应用中混合框架,电话,双音素,语素和词级深度神经网络(DNN)的方法。该方法可以应用于许多应用。该方法基于常规的ASR系统,该系统可以基于高斯混合模型(GMM)或DNN。第一步,训练常规的ASR模型。所有训练数据(以特征格式)都与成绩单保持一致,包括音素和带有计时信息的单词。功能规范化可以应用于这些新功能。根据对齐时间信息,就音素,双音素,词素和最多单词形成新的功能。执行第一遍常规语音识别,并生成结果格。在晶格中,每个单词都有时间信息。然后提取特征并将其发送到单词级DNN进行评分。如果单词不在单词级DNN词汇表中,则执行强制对齐以获得每个音素的定时信息。然后将这些音素,双音素和语素的特征发送到相应的DNN进行训练。然后将这些分数组合起来,形成单词级别分数。以这种方式,对栅格重新打标,并产生新的识别结果。

著录项

  • 公开/公告号US2018047385A1

    专利类型

  • 公开/公告日2018-02-15

    原文格式PDF

  • 申请/专利权人 APPTEK INC.;

    申请/专利号US201715672486

  • 发明设计人 JINTAO JIANG;HASSAN SAWAF;MUDAR YAGHI;

    申请日2017-08-09

  • 分类号G10L15/06;G10L25/30;G10L15/02;G10L15/187;G10L15/16;G10L15/32;

  • 国家 US

  • 入库时间 2022-08-21 13:03:32

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号