Data Mining Approach for Prosody Modelling by ANN in Text-to-Speech Synthesis

机译：浅谈文字综合效果韵律建模的数据挖掘方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This contribution describes the artificial neural network (ANN) approach for the modelling of fundamental frequency and a duration of speech unit in a text-to-speech (TTS) synthesis. We try to investigate methods for extracting knowledge from the existing speech databases and minimise the number of optimised neural network parameters to improve the generalisation ability of ANN. We try to especially improve the quality of prosody. The ANN for the modelling of two prosody parameters for a ITS synthesis are trained by natural speech. We applied the GUHA method (General Unary Hypotheses Automaton) [1] for the choice of the most important input parameters, and a standard pruning process of ANN, [9] for optimisation of the generalisation ability.

机译：该贡献描述了用于在文本到语音（TTS）合成中的基本频率和语音单元持续时间的人工神经网络（ANN）方法。我们尝试调查从现有语音数据库中提取知识的方法，并最大限度地减少优化的神经网络参数的数量，以提高ANN的泛化能力。我们试图特别提高韵律的质量。对于其合成的两个韵律参数建模的ANN通过自然语音培训。我们应用了Guha方法（一般Unary Husbotheses Automaton）[1]选择最重要的输入参数，以及ANN，[9]的标准修剪过程，用于优化泛化能力。

著录项

来源
《IASTED international conference on artificial intelligence and applications》|2001年||共6页
会议地点
作者
Jana Tuckova; Vaclav Sebesta; International Association of Science and Technology for Development; null;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
signal and image processing; prosody modelling; application of neural networks; data mining;

机译：信号和图像处理;韵律建模;神经网络的应用;数据挖掘;

相似文献

外文文献
中文文献
专利

1. Prosody modeling for syllable based text-to-speech synthesis using feedforward neural networks [J] . Reddy V. Ramu, Rao K. Sreenivasa Neurocomputing . 2016,第JANa1期

机译：使用前馈神经网络进行基于音节的语音合成的韵律建模
2. Modeling stylized invariance and local variability of prosody in text-to-speech synthesis [J] . Chu M, Zhao Y, Chang E Speech Communication . 2006,第6期

机译：在文本到语音合成中建模韵律的程式化不变性和局部可变性
3. A rule based prosody model for Turkish text-to-speech synthesis [J] . Uslu Ibrahim Baran, Ilk Hakki Gokhan, Yilmaz Asim Egemen Technical Gazette . 2013,第2期

机译：土耳其语文本语音合成的基于规则的韵律模型
4. Data Mining Approach for Prosody Modelling by ANN in Text-to-Speech Synthesis [C] . Jana Tuckova, Vaclav Sebesta Artificial Intelligence and Applications . 2001

机译：文本语音合成中基于人工神经网络的韵律建模数据挖掘方法
5. Building a prosodically sensitive diphone database for a Korean text-to-speech synthesis system. [D] . Yoon, Kyuchul. 2005

机译：为韩国文字转语音合成系统建立一个对韵律敏感的diphone数据库。
6. A data mining approach for modeling churn behavior via RFM model in specialized clinics Case study: A public sector hospital in Tehran [O] . Mehdi Mohammadzadeh, Zeinab Zare Hoseini, Hamid Derafshi -1

机译：通过RFM模型在专门诊所中对流失行为进行建模的数据挖掘方法案例研究：德黑兰一家公立医院
7. Modeling Prosody Patterns for Chinese Expressive Text-to-Speech Synthesis [O] . Zhiyong Wu, Lianhong Cai, Helen M. Meng 2015

机译：中文表达文本到语音合成的韵律模式建模

Data Mining Approach for Prosody Modelling by ANN in Text-to-Speech Synthesis

摘要

著录项

相似文献

相关主题

期刊订阅