首页> 外文会议>International Symposium on Chinese Spoken Language Processing >A COMPARATIVE STUDY ON VARIOUS CONFIDENCE MEASURES IN LARGE VOCABULARY SPEECH RECOGNITION
【24h】

A COMPARATIVE STUDY ON VARIOUS CONFIDENCE MEASURES IN LARGE VOCABULARY SPEECH RECOGNITION

机译:大型词汇语音识别中各种信心措施的比较研究

获取原文

摘要

In this paper, we have conducted a comparative study on several confidence measures (CMs) for large vocabulary speech recognition. Firstly, we propose a novel high-level CM that is based on the inter-word mutual information (MI). Secondly, we experimentally investigate several popular low-level CMs, such as word posterior probabilities, N-best counting, Likelihood Ratio Testing (LRT), etc. Finally, we have studied a simple linear interpolation strategy to combine the best low-level CMs with the best high-level CMs. All of these CMs are examined in two large vocabulary ASR tasks, namely the Switchboard task and a mandarin dictation task, to verify the recognition errors in baseline recognition systems. Experimental results show: 1) the proposed Mi-based CMs greatly surpass another existing high-level CMs which are based on the LSA technique; 2) Among all low-level CMs, word posteriori probabilities give the best verification performance; 3) When combining the word posteriori probabilities with the Mi-based CMs, the equal error rate is reduced from 24.4% to 23.9% in the Switchboard task and from 17.5% to 16.2% in the mandarin dictation task.
机译:在本文中,我们对大型词汇表识别的几种置信度量(CMS)进行了比较研究。首先,我们提出了一种基于词交际互联信息(MI)的新型高级CM。其次,我们通过实验研究了几个流行的低级CM,如Word后验概率,N最佳计数,似然比测试(LRT)等。最后,我们研究了一个简单的线性插值策略来组合最佳的低级CMS最佳高级CMS。所有这些CMS都在两个大型词汇ASR任务中检查,即交换机任务和普通话检测任务,以验证基线识别系统中的识别错误。实验结果表明:1)所提出的基于MI的CMS极大地超越了基于LSA技术的现有高级CMS; 2)在所有低级CMS中,Word后验概率提供了最佳的验证性能; 3)与基于MI的CMS相结合的单词后验概率时,交换机任务中的24.4%降低了24.4%至23.9%,普通话检测任务中的17.5%至16.2%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号