Word error rate improvement and complexity reduction in Automatic Speech Recognition by analyzing acoustic model uncertainty and confusion

机译：通过分析声学模型不确定性和混淆来单词误差率改善和自动语音识别的复杂性降低

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, a study about the uncertainty of the trained acoustic models and the confusion among these models is made in the context of speech recognition. The purpose is to find the most relevant voice features, hence the analysis is made on a per-feature basis. Model uncertainty is defined as a measure of feature distribution overlapping. A model is compared only to the models it is more similar to. Hence, confusion matrices are built from both feature distributions and recognition results. Next, the voice features are weighted according to their relevance in order to increase the discrimination among models, while relevance itself is deduced from the values of model uncertainty. Experimental results show that, by appropriate weighting, the recognition accuracy, in terms of Word Error Rate (WER), improves. Moreover, by removing the features with lower weights, the recognition accuracy is maintained, but the number of calculations is significantly reduced.

机译：在本文中，在语音识别的背景下，对训练有素的声学模型的不确定性以及这些模型中的混淆的研究。目的是找到最相关的语音功能，因此分析是按每个特征的基础进行的。模型不确定性被定义为特征分配重叠的量度。仅将模型与其更类似于的模型进行比较。因此，困惑矩阵由特征分布和识别结果构建。接下来，根据其相关性来加权语音特征，以便增加模型之间的识别，而相关性本身将从模型不确定性的值推导出来。实验结果表明，通过适当的加权，识别准确性在字错误率（WER）方面，改进。此外，通过去除具有较低重量的特征，保持识别精度，但计算的数量显着降低。

著录项

来源
《Conference on Speech Technology and Human-Computer Dialogue》|2011年||共8页
会议地点
作者
Buzo Andi; Cucu Horia; Burileanu Corneliu; Pasca Miruna; Popescu Vladimir;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
Acoustic Model Uncertainty; Automatic Speech Recognition; Model Confusion;

机译：声学模型不确定性;自动语音识别;模型混乱;

相似文献

外文文献
中文文献
专利

1. Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition: A comparison of current training strategies [J] . Cui Xiaodong, Zhang Wei, Finkler Ulrich, IEEE Signal Processing Magazine . 2020,第3期

机译：自动语音识别深神经网络声学模型的分布式训练：当前训练策略的比较
2. Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition [J] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, IEICE transactions on information and systems . 2019,第12期

机译：潜在词递归神经网络语言模型用于自动语音识别
3. Domain Adaptation Based on Mixture of Latent Words Language Models for Automatic Speech Recognition [J] . Ryo MASUMURA, Taichi ASAMI, Takanobu OBA, IEICE transactions on information and systems . 2018,第6期

机译：基于潜在词语言模型混合的领域自适应语音自动识别
4. Word error rate improvement and complexity reduction in Automatic Speech Recognition by analyzing acoustic model uncertainty and confusion [C] . Buzo Andi, Cucu Horia, Burileanu Corneliu, Proceedings of the 6th International Conference on Speech Technology and Human-Computer Dialogue . 2011

机译：通过分析声学模型的不确定性和混乱度，提高自动语音识别中的单词错误率并降低复杂度
5. Graph-based Semi-Supervised Learning in Acoustic Modeling for Automatic Speech Recognition. [D] . Liu, Yuzong. 2016

机译：用于自动语音识别的声学建模中基于图的半监督学习。
6. Words from spontaneous conversational speech can be recognized with human-like accuracy by an error-driven learning algorithm that discriminates between meanings straight from smart acoustic features bypassing the phoneme as recognition unit [O] . Denis Arnold, Fabian Tomaschek, Konstantin Sering, -1

机译：通过错误驱动的学习算法可以区分自发会话语音中的单词其准确性与人类类似可以从智能声学特征中区分出含义而绕过音素作为识别单元
7. Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition [O] . Sunit Sivasankaran, Emmanuel Vincent, Dominique Fohr 2021

机译：分析扬声器本地化误差对自动语音识别语音分离的影响

Word error rate improvement and complexity reduction in Automatic Speech Recognition by analyzing acoustic model uncertainty and confusion

摘要

著录项

相似文献

相关主题

期刊订阅