HMM-based Speech Synthesis System incorporated with Language Identification for Low-resourced Languages

机译：基于HMM的语音合成系统，结合了低资源语言的语言识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text-to-speech (TTS) synthesis systems are of benefit towards learning new or foreign languages. These systems are currently available for various major languages but not available for low-resourced languages. Scarcity of these systems may lead to challenges in learning new languages specifically low-resourced languages. Development of language-specific systems like TTS and Language identification (LID) have an important task to address in mitigating the historical linguistic effects of discrimination and domination imposed onto low-resourced indigenous languages. This paper presents the development of a multi-language LID+TTS synthesis system that generate audio of input text using the predicted language in four South African languages, namely: Tshivenda, Sepedi, Xitsonga and IsiNdebele. On the front-end, is the LID module that detects language of the input text before the TTS synthesis module produces output audio. The LID module is trained on a 4 million words dataset resulted with 99% accuracy outperforming the state-of-the-art systems. A robust method for building TTS voices called hidden Markov model method is used to build new voices in the selected languages. The quality of the voices is measured using the mean opinion score and word error rate metrics that resulted with positive results on the understandability, naturalness, pleasantness, intelligibility and overall impression of the system of the newly created TTS voices. The system is available as a website service.

机译：文本到语音（TTS）合成系统对学习新的或外语有益。这些系统目前可用于各种主要语言，但不适用于低资源语言。这些系统的稀缺可能导致学习新语言的挑战专门低资源的语言。特定于TTS和语言识别（LID）等语言特定系统的开发有一个重要的任务，可以解决减轻鉴别和统治的历史语言影响，施加到低资源的土着语言。本文介绍了多语言盖+ TTS综合系统的开发，使用四种南非语言中的预测语言生成输入文本的音频，即：Tshivenda，Sepedi，Xitsonga和Isindebele。在前端，是在TTS合成模块产生输出音频之前检测输入文本的语言的盖模块。盖模块培训，在400万字数据上培训，导致99％的精度优于最先进的系统。用于构建名为Hidden Markov Model方法的TTS声音的强大方法用于在所选语言中构建新的声音。使用平均意见分数和单词错误率指标来测量声音的质量，导致积极的结果对新创建的TTS声音系统的可理解性，自然，愉悦度，可懂度和整体印象产生积极的结果。该系统可作为网站服务提供。

著录项

来源
《International Conference on Advances in Big Data, Computing and Data Communication Systems》|2019年|1-6|共6页
会议地点
作者
Tshephisho Joseph Sefara; Tumisho Billson Mokgonyane; Madimetja Jonas Manamela; Thipe Isaiah Modipa;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Hidden Markov models; Speech synthesis; Tools; Dictionaries; Training; Computer science;

机译：隐马尔可夫模型;语音合成;工具;词典;培训;计算机科学;

相似文献

外文文献
中文文献
专利

1. Multilingual Speech Corpus in Low-Resource Eastern and Northeastern Indian Languages for Speaker and Language Identification [J] . Basu Joyanta, Khan Soma, Roy Rajib, Circuits, systems and signal processing . 2021,第10期

机译：用于扬声器和语言识别的低资源东部和东北印度语言语言的多语种演讲语料库
2. Semantic speech recognition in the Basque context Part Ⅱ: language identification for under-resourced languages [J] . Nora Barroso, Karmele Lopez de Ipina, Carmen Hernandez, International journal of speech technology . 2012,第1期

机译：巴斯克语境中的语义语音识别第二部分：资源匮乏语言的语言识别
3. Cross-language identification of long-term average speech spectra in korean and english: Toward a better understanding of the quantitative difference between two languages [J] . NohH., LeeD.-H. Ear and hearing. . 2012,第3期

机译：跨语言识别韩语和英语的长期平均语音频谱：更好地理解两种语言之间的数量差异
4. HMM-based Speech Synthesis System incorporated with Language Identification for Low-resourced Languages [C] . Tshephisho Joseph Sefara, Tumisho Billson Mokgonyane, Madimetja Jonas Manamela, International Conference on Advances in Big Data, Computing and Data Communication Systems . 2019

机译：基于HMM的语音合成系统，并包含低资源语言的语言识别
5. Text-to-Speech Synthesis Using Found Data for Low-Resource Languages [D] . Cooper, Erica 2019

机译：使用低资源语言的数据对文本进行语音合成
6. Electrophysiological evidence of functional integration between the language and motor systems in the brain: A study of the speech Bereitschaftspotential [O] . J.J. McArdle, Z. Mari, R.H. Pursley, -1

机译：大脑语言和运动系统之间功能整合的电生理证据：语音Bereitschaftspotential的研究
7. HMM-based Mixed-language (Mandarin-English) Speech Synthesis [O] . Yao Qian, Houwei Cao, Frank K. Soong 2013

机译：基于Hmm的混合语言（普通话 - 英语）语音合成
8. Speech Recognition, Articulatory Feature Detection, and Speech Synthesis in Multiple Languages [R] . Ore, B. M. 2009

机译：语音识别，发音特征检测和多语言语音合成

HMM-based Speech Synthesis System incorporated with Language Identification for Low-resourced Languages

摘要

著录项

相似文献

相关主题

期刊订阅