Comparison of language models trained on written texts and speech transcripts in the context of automatic speech recognition

机译：在自动语音识别的情况下，在书面文字和语音记录上训练的语言模型的比较

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We investigate whether language models used in automatic speech recognition (ASR) should be trained on speech transcripts rather than on written texts. By calculating log-likelihood statistic for part-of-speech (POS) n-grams, we show that there are significant differences between written texts and speech transcripts. We also test the performance of language models trained on speech transcripts and written texts in ASR and show that using the former results in greater word error reduction rates (WERR), even if the model is trained on much smaller corpora. For our experiments we used the manually labeled one million subcorpus of the National Corpus of Polish and an HTK acoustic model.

机译：我们调查是否应在语音记录而非在书面文本上训练用于自动语音识别（ASR）的语言模型。通过计算词性（POS）n-gram的对数似然统计，我们发现书面文本和语音记录之间存在显着差异。我们还测试了在ASR中针对语音成绩单和书面文本训练的语言模型的性能，并表明，即使使用小得多的语料库训练，使用前者也会导致更大的单词错误减少率（WERR）。对于我们的实验，我们使用了波兰国家语料库的手动标记的一百万个子语料库和HTK声学模型。

著录项

来源
《Federated Conference on Computer Science and Information Systems》|2015年|193-197|共5页
会议地点
作者
Dziadzio Sebastian; Nabozny Aleksandra; Smywinski-Pohl Aleksander; Ziolko Bartosz;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
speech recognition; text analysis; ASR; HTK acoustic model; National Corpus of Polish; PO n-gram; WERR; automatic speech recognition; part-of-speech n-gram; texts speech transcript; word error reduction rate; Acoustics; Automatic speech recognition; Computational modeling; Computer science; Speech; Tagging; automatic speech recognition; morphosyntactic language model; written and spoken language comparison;

机译：语音识别;文本分析; ASR; HTK声学模型;波兰国家语料库; PO n-gram; WERR;自动语音识别;词性n-gram;文本语音成绩单;单词错误减少率;声学;自动语音识别;计算建模;计算机科学;语音;标记;自动语音识别;句法语言模型;书面和口头语言比较;

相似文献

外文文献
中文文献
专利

1. Comparison of Performance of Enhanced Morpheme-based Language Model with Different Word-based Language Models for Improving the Performance of Tamil Speech Recognition System [J] . S. SARASWATHI, T.V. GEETHA ACM transactions on Asian language information processing . 2007,第3期

机译：增强的基于词素的语言模型与不同的基于单词的语言模型的性能比较，以提高泰米尔语语音识别系统的性能
2. Discriminatively trained continuous Hindi speech recognition system using interpolated recurrent neural network language modeling [J] . Dua Mohit, Aggarwal R. K., Biswas Mantosh Neural computing & applications . 2019,第10期

机译：使用内插复发性神经网络语言建模判别训练的连续印地语语音识别系统
3. Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha [J] . Tessfu Geteye Fantaye, Junqing Yu, Tulu Tilahun Hailu Journal of Signal and Information Processing . 2020,第1期

机译：Chaha非常低于资源语言的多语言深神经网络建模方法对自动语音识别系统的研究
4. Comparison of language models trained on written texts and speech transcripts in the context of automatic speech recognition [C] . Dziadzio Sebastian, Nabozny Aleksandra, Smywinski-Pohl Aleksander, Federated Conference on Computer Science and Information Systems . 2015

机译：在自动语音识别的背景下，在书面文本和语音成绩单上培训的语言模型的比较
5. Investigating different models for cross-language information retrieval from automatic speech transcripts. [D] . Alzghool, Muath. 2009

机译：研究用于从自动语音笔录中获取跨语言信息的不同模型。
6. A systematic comparison of contemporary automatic speech recognition engines for conversational clinical speech [O] . Jodi Kodish-Wachs, Emin Agassi, Patrick Kenny III, 2018

机译：当代自动语音识别引擎用于对话式临床语音的系统比较
7. Comparison Of Part-Of-Speech And Automatically Derived Category-Based Language Models For Speech Recognition [O] . T.R. Niesler, E. W. D. Whittaker, P.C. Woodland 1998

机译：语音识别的词性和自动派生基于类别的语言模型的比较

Comparison of language models trained on written texts and speech transcripts in the context of automatic speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅