A COMPARATIVE STUDY ON VARIOUS CONFIDENCE MEASURES IN LARGE VOCABULARY SPEECH RECOGNITION

机译：大型词汇语音识别中各种信心措施的比较研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we have conducted a comparative study on several confidence measures (CMs) for large vocabulary speech recognition. Firstly, we propose a novel high-level CM that is based on the inter-word mutual information (MI). Secondly, we experimentally investigate several popular low-level CMs, such as word posterior probabilities, N-best counting, Likelihood Ratio Testing (LRT), etc. Finally, we have studied a simple linear interpolation strategy to combine the best low-level CMs with the best high-level CMs. All of these CMs are examined in two large vocabulary ASR tasks, namely the Switchboard task and a mandarin dictation task, to verify the recognition errors in baseline recognition systems. Experimental results show: 1) the proposed Mi-based CMs greatly surpass another existing high-level CMs which are based on the LSA technique; 2) Among all low-level CMs, word posteriori probabilities give the best verification performance; 3) When combining the word posteriori probabilities with the Mi-based CMs, the equal error rate is reduced from 24.4% to 23.9% in the Switchboard task and from 17.5% to 16.2% in the mandarin dictation task.

机译：在本文中，我们对大型词汇表识别的几种置信度量（CMS）进行了比较研究。首先，我们提出了一种基于词交际互联信息（MI）的新型高级CM。其次，我们通过实验研究了几个流行的低级CM，如Word后验概率，N最佳计数，似然比测试（LRT）等。最后，我们研究了一个简单的线性插值策略来组合最佳的低级CMS最佳高级CMS。所有这些CMS都在两个大型词汇ASR任务中检查，即交换机任务和普通话检测任务，以验证基线识别系统中的识别错误。实验结果表明：1）所提出的基于MI的CMS极大地超越了基于LSA技术的现有高级CMS; 2）在所有低级CMS中，Word后验概率提供了最佳的验证性能; 3）与基于MI的CMS相结合的单词后验概率时，交换机任务中的24.4％降低了24.4％至23.9％，普通话检测任务中的17.5％至16.2％。

著录项

来源
《International Symposium on Chinese Spoken Language Processing》|2004年||共4页
会议地点
作者
Gang Guo; Chao Huang; Hui Jiang; Ren-Hua Wang; Chinese University of Hong Kong; IEEE Hong Kong Chapter of Signal Processing;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序语言、算法语言;
关键词

相似文献

外文文献
中文文献
专利

1. A comparative study on selecting acoustic modeling units in deep neural networks based large vocabulary Chinese speech recognition [J] . Li Xiangang, Yang Yuning, Pang Zaihu, Neurocomputing . 2015,第deca25期

机译：基于大词汇量中文语音识别的深度神经网络中声学建模单元选择的比较研究
2. Maximum Entropy-Based Reinforcement Learning Using a Confidence Measure in Speech Recognition for Telephone Speech [J] . Molina C., Yoma N. B., Huenupan F., Audio, Speech, and Language Processing, IEEE Transactions on . 2010,第5期

机译：电话语音识别中基于置信度的最大熵增强学习
3. Aging speech recognition with speaker adaptation techniques: Study on medium vocabulary continuous Bengali speech [J] . Biswajit Das, Sandipan Mandal, Pabitra Mitra, Pattern recognition letters . 2013,第3期

机译：说话人适应技术对语音的老化识别：中词汇连续孟加拉语语音研究
4. A comparative study on various confidence measures in large vocabulary speech recognition [C] . Gang Guo, Chao Huang, Hui Jiang, . 2004

机译：大型词汇语音识别中各种置信度测度的比较研究
5. Assessment of a measure of response confidence for a speech recognition task in noise. [D] . Dundas, John Andrew. 2009

机译：评估语音识别任务在噪声中的响应置信度。
6. Deep Spiking Neural Networks for Large Vocabulary Automatic Speech Recognition [O] . Jibin Wu, Emre Yılmaz, Malu Zhang, 2020

机译：大型词汇自动语音识别深尖峰神经网络
7. A comparative study on various confidence measures in large vocabulary speech recognition [O] . Gang Guo, Chao Huang, Hui Jiang, 2004

机译：大型词汇语音识别中各种置信度测度的比较研究

A COMPARATIVE STUDY ON VARIOUS CONFIDENCE MEASURES IN LARGE VOCABULARY SPEECH RECOGNITION

摘要

著录项

相似文献

相关主题

期刊订阅