Modeling under-resourced languages for speech recognition

Kurimo Mikko; Enarvi Seppo; Tilk Ottokar; Varjokallio Matti; Mansikkaniemi Andre; Alumae Tanel

首页> 外文期刊>Language Resources and Evaluation >Modeling under-resourced languages for speech recognition

【24h】

Modeling under-resourced languages for speech recognition

机译：为语音识别建模资源不足的语言

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

One particular problem in large vocabulary continuous speech recognition for low-resourced languages is finding relevant training data for the statistical language models. Large amount of data is required, because models should estimate the probability for all possible word sequences. For Finnish, Estonian and the other fenno-ugric languages a special problem with the data is the huge amount of different word forms that are common in normal speech. The same problem exists also in other language technology applications such as machine translation, information retrieval, and in some extent also in other morphologically rich languages. In this paper we present methods and evaluations in four recent language modeling topics: selecting conversational data from the Internet, adapting models for foreign words, multi-domain and adapted neural network language modeling, and decoding with subword units. Our evaluations show that the same methods work in more than one language and that they scale down to smaller data resources.

机译：资源匮乏的语言在大词汇量连续语音识别中的一个特殊问题是找到统计语言模型的相关训练数据。由于模型应该估计所有可能的单词序列的概率，因此需要大量数据。对于芬兰语，爱沙尼亚语和其他芬诺语/俄语语言，数据存在一个特殊问题，那就是正常语音中常见的大量不同单词形式。在其他语言技术应用（例如机器翻译，信息检索）中以及在某种程度上在其他形态丰富的语言中也存在相同的问题。在本文中，我们介绍了四个最近的语言建模主题中的方法和评估：从Internet选择会话数据，为外来词改编模型，多域和改编的神经网络语言建模以及使用子词单元进行解码。我们的评估表明，相同的方法可以使用多种语言，并且可以缩小为较小的数据资源。

著录项

来源
《Language Resources and Evaluation》 |2017年第4期|961-987|共27页
作者
Kurimo Mikko; Enarvi Seppo; Tilk Ottokar; Varjokallio Matti; Mansikkaniemi Andre; Alumae Tanel;
展开▼
作者单位

Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland;

Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland;

Tallinn Univ Technol, Inst Cybernet, Tallinn, Estonia;

Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland;

Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland;

Tallinn Univ Technol, Inst Cybernet, Tallinn, Estonia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Large vocabulary speech recognition; Statistical language modeling; Subword units; Data filtering; Adaptation;

机译：大词汇语音识别;统计语言建模;子词单元;数据过滤;自适应;

相似文献

外文文献
中文文献
专利

1. Speech recognition for under-resourced languages: Data sharing in hidden Markov model systems [J] . Febe de Wet, Neil Kleynhans, Dirk van Compernolle, South African Journal of Science . 2017,第1a2期

机译：资源不足语言的语音识别：隐马尔可夫模型系统中的数据共享
2. Semantic speech recognition in the Basque context Part Ⅱ: language identification for under-resourced languages [J] . Nora Barroso, Karmele Lopez de Ipina, Carmen Hernandez, International journal of speech technology . 2012,第1期

机译：巴斯克语境中的语义语音识别第二部分：资源匮乏语言的语言识别
3. Cross-Lingual Phone Mapping for Large Vocabulary Speech Recognition of Under-Resourced Languages [J] . Van Hai DO, Xiong XIAO, Eng Siong CHNG, IEICE transactions on information and systems . 2014,第2期

机译：资源不足语言的大词汇语音识别的跨语言电话映射
4. Design of multi-feature class models for Speech Recognition Security systems with under-resourced languages [C] . Barroso N., de Ipina K. Lopez, Hernandez C., 2011 IEEE International Carnahan Conference on Security Technology . 2011

机译：资源不足语言的语音识别安全系统的多特征类模型设计
5. Arabic language modeling with stem-derived morphemes for automatic speech recognition. [D] . Heintz, Ilana. 2010

机译：具有词干衍生语素的阿拉伯语言建模，可实现自动语音识别。
6. Using Morphological Data in Language Modeling for Serbian Large Vocabulary Speech Recognition [O] . Edvin Pakoci, Branislav Popović, Darko Pekar 2019

机译：在塞尔维亚大型词汇语音识别的语言建模中使用形态学数据
7. Modeling under-resourced languages for speech recognition [O] . Kurimo, Mikko, Enarvi, Seppo, Tilk, Ottokar, 2016

机译：为语音识别建模资源不足的语言
8. Robust Speech Processing & Recognition: Speaker ID, Language ID, Speech Recognition/Keyword Spotting, Diarization/Co-Channel/Environmental Characterization, Speaker State Assessment. [R] . Hansen, J. H. 2015

机译：强大的语音处理和识别：说话者ID，语言ID，语音识别/关键字识别，Diarization / Co-Channel /环境表征，说话者状态评估。

Modeling under-resourced languages for speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅