AUTOMATIC SPEECH RECOGNITION FOR UNDER-RESOURCED LANGUAGES

机译：资源欠资克语言的自动语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents our methodology for ASR in the context of under-resourced languages. Our data collection methodology is explained. Then, different techniques for bootstrapping acoustic models are presented: cross-lingual and grapheme-based acoustic modelling. Firstly, we present the potential of cross-lingual independent and dependent acoustic modelling for Vietnamese language. Experimental results on Vietnamese ASR show that when we have only a few hours of speech data in the target language, cross-lingual context-independent (CI) modelling works better. However, when we have more speech data, cross-lingual CI modelling is outperformed by cross-lingual context-dependent (CD) modeling. We also conclude that, in both cases, cross-lingual systems are better than monolingual baseline systems. We also investigate some techniques of grapheme-based acoustic modeling. To improve the performance of the graphemic acoustic models initialization, we use a word boundary detector to segment an utterance into words. This technique eliminates some interword segmentation errors. Moreover, results obtained both from Vietnamese and Khmer ASR demonstrated the feasibility of the grapheme-based approach. Finally, we also present preliminary experiments in statistical language modelling for reducing the complexity of the models using subword units. The potential of such an approach is shown for dialectal Arabic where very few text data are available for training a statistical language model.

机译：本文介绍了我们在资源不足的语言范围内的ASR方法。我们的数据收集方法解释。然后，提出了用于自动启动声学模型的不同技术：基于交叉语言和基于Grapheme的声学建模。首先，我们展示了越南语交叉独立和依赖声学建模的潜力。越南ASR上的实验结果表明，当我们在目标语言中只有几个小时的语音数据时，跨语明上下文 - 无关（CI）建模更好。然而，当我们有更多的语音数据时，通过交叉语言相关的（CD）建模，交叉语言CI建模超越。我们还得出结论，在这两种情况下，交叉系统优于单机基线系统。我们还研究了一些基于石墨对的声学建模技术。为了提高图形声学模型初始化的性能，我们使用一个字边界检测器将话语分割为单词。该技术消除了一些interword分段错误。此外，从越南和高棉ASR获得的结果证明了基于石墨对的方法的可行性。最后，我们还在统计语言建模中提出初步实验，用于使用子字单元降低模型的复杂性。这种方法的潜力显示在辩证阿拉伯语中，在那里可以训练统计语言模型很少的文本数据。

著录项

来源
《Conference on Speech Technology and Human-Computer Dialogue》|2007年||共14页
会议地点
作者
Laurent BESACIER; Viet-Bac LE;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
Speech recognition; Under-resourced languages; Acoustic model bootstrapping; Sub-word units for language modelling;

机译：语音识别;资源以上语言;声学模型引导;语言建模的子字单元;

相似文献

外文文献
中文文献
专利

1. Semantic speech recognition in the Basque context Part Ⅱ: language identification for under-resourced languages [J] . Nora Barroso, Karmele Lopez de Ipina, Carmen Hernandez, International journal of speech technology . 2012,第1期

机译：巴斯克语境中的语义语音识别第二部分：资源匮乏语言的语言识别
2. Modeling under-resourced languages for speech recognition [J] . Kurimo Mikko, Enarvi Seppo, Tilk Ottokar, Language Resources and Evaluation . 2017,第4期

机译：为语音识别建模资源不足的语言
3. Speech recognition for under-resourced languages: Data sharing in hidden Markov model systems [J] . Febe de Wet, Neil Kleynhans, Dirk van Compernolle, South African Journal of Science . 2017,第1a2期

机译：资源不足语言的语音识别：隐马尔可夫模型系统中的数据共享
4. AUTOMATIC SPEECH RECOGNITION FOR UNDER-RESOURCED LANGUAGES [C] . Laurent BESACIER, Viet-Bac LE Conference on Speech Technology and Human-Computer Dialogue . 2007

机译：资源欠资克语言的自动语音识别
5. Automatic Speech Recognition for Low-Resource and Morphologically Complex Languages [D] . Morris, Ethan. 2021

机译：用于低资源和形态复杂语言的自动语音识别
6. Automatic Classification of the Korean Triage Acuity Scale in Simulated Emergency Rooms Using Speech Recognition and Natural Language Processing: a Proof of Concept Study [O] . Dongkyun Kim, Jaehoon Oh, Heeju Im, 2021

机译：使用语音识别和自然语言处理的模拟急诊室中韩国分流刻度的自动分类：概念研究证明
7. Speech recognition for under-resourced languages: Data sharing in hidden Markov model systems [O] . de Wet Febe, Kleynhans Neil, Van Compernolle Dirk, 2017

机译：资源匮乏语言的语音识别：隐马尔可夫模型系统中的数据共享

AUTOMATIC SPEECH RECOGNITION FOR UNDER-RESOURCED LANGUAGES

摘要

著录项

相似文献

相关主题

期刊订阅