A Hybrid Approach to Enhance Task Portability of Acoustic Models in Chinese Speech Recognition

机译：增强中文语音识别声学模型任务可移植性的混合方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents our approach to enhance the portability of acoustic models by mitigating the phonetic mismatch arising from a new testing task which is rather different from the training data. The approach is a hybrid one which combines knowledge-based context categorization to generate a context rich set of subword units, and data-driven-based acoustic model clustering on the level of context category. Compared with the conventional approach of only phonetic decision tree based model clustering and unseen model generation, the new approach improved greatly the desired subword coverage for the new testing domain, and achieved an error rate reduction by 10.8% for Chinese character accuracy in the recognition experiments. Together with the effect of the newly adopted basic units of 9 glottal stops, we achieved a total 23.5% error rate reduction in the testing compared to the baseline system.

机译：本文介绍了我们的方法，通过减轻与测试数据完全不同的新测试任务引起的语音不匹配，来增强声学模型的可移植性。该方法是一种混合方法，它结合了基于知识的上下文分类以生成上下文丰富的子词单元集，以及在上下文类别级别上基于数据驱动的声学模型聚类。与仅基于语音决策树的模型聚类和看不见的模型生成的常规方法相比，新方法极大地提高了新测试域所需的子词覆盖率，并在识别实验中将汉字精度的错误率降低了10.8％。加上新采用的9个声门止动的基本单位的效果，与基准系统相比，我们在测试中总共降低了23.5％的错误率。

著录项

来源
《European Conference on Speech Communication and Technology v.3; 20010903-20010907; Aalborg; DK》|2001年|P.1661-1664|共4页
会议地点 Aalborg(DK);Aalborg(DK)
作者
Jin-Song Zhang; Shu-Wu Zhang; Yoshinori Sagisaka; Satoshi Nakamura;
展开▼
作者单位

ATR Spoken Language Translation Research Laboratories 2-2-2 Hikaridai Seika-cho, Soraku-gun, Kyoto 619-0288, Japan;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类传播理论;
关键词

相似文献

外文文献
中文文献
专利

1. A Fast Adaptation Approach for Enhanced Automatic Recognition of Children's Speech with Mismatched Acoustic Models [J] . Shahnawazuddin S., Sinha Rohit Circuits, systems, and signal processing . 2018,第3期

机译：利用不匹配的声学模型增强儿童语音自动识别的快速自适应方法
2. A Hybrid Acoustic and Pronunciation Model Adaptation Approach for Non-native Speech Recognition [J] . Yoo Rhee OH, Hong Kook KIM IEICE transactions on information and systems . 2010,第9期

机译：非母语语音识别的混合声学和发音模型自适应方法
3. A Hybrid Acoustic and Pronunciation Model Adaptation Approach for Non-native Speech Recognition [J] . Yoo Rhee OH, Hong Kook KIM IEICE Transactions on Information and Systems . 2010,第9期

机译：非母语语音识别的混合声学模型和语音模型自适应方法
4. A Hybrid Approach to Enhance Task Portability of Acoustic Models in Chinese Speech Recognition [C] . Jin-Song Zhang, Shu-Wu Zhang, Yoshinori Sagisaka, European conference on speech communication and technology . 2001

机译：一种杂交方法，提升中国语音识别中声学模型的任务便携性
5. High-performance automatic speech recognition via enhanced front-end analysis and acoustic modeling. [D] . Gu, Liang. 2001

机译：通过增强的前端分析和声学建模实现高性能的自动语音识别。
6. Recognition of Emotions in Mexican Spanish Speech: An Approach Based on Acoustic Modelling of Emotion-Specific Vowels [O] . Santiago-Omar Caballero-Morales 2013

机译：墨西哥西班牙语语音中的情绪识别：一种基于情绪特定元音声学模型的方法
7. Efficient Acoustic Modeling Method for Unsupervised Speech Recognition using Multi-Task Deep Neural Network [O] . Haitao Yao, Maobo An, Ji Xu, 2016

机译：使用多任务深神经网络的无监督语音识别有效的声学建模方法

A Hybrid Approach to Enhance Task Portability of Acoustic Models in Chinese Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅