首页> 中文期刊> 《中国通信:英文版》 >基于加权音节混淆矩阵的候选扩展算法在中文大词汇量连续语音识别中的应用(英文)

基于加权音节混淆矩阵的候选扩展算法在中文大词汇量连续语音识别中的应用(英文)

         

摘要

The inclusion of more potentiallycorrect words in the candidate sets is important to improve the accuracy of Large Vocabulary Continuous Speech Recognition(LVCSR).A candidate expansion algorithm based on theWeighted Syllable Confusion Matrix(WSCM)is proposed.First,WSCM is derived from aconfusion network.Then,the recognised candidates in the confusion network is used to conjecture the most likely correct words based onWSCM,after which,the conjectured wordsare combined with the recognised candidatesto produce an expanded candidate set.Finally,a combined model having mutual informationand a trigram language model is used to rerank the candidates.The experiments on Mandarin film data show that an improvement of9.57% in the character correction rate is obtained over the initial recognition performanceon those light erroneous utterances.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号