A superpositional model applied to F0 parameterization using DCT for text-to-speech synthesis

机译：使用DCT应用于F0参数化的超导模型，用于语音合成文本

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper addresses the idea of the superpositional model based on the DCT (Discrete Cosine Transform) parameterization of the F0 contours. We examine the capacity of the DCT coefficients to estimate the fast variations in the F0 contour at syllable level and also the overall trend of the phrase level. The method determines the coefficients at syllable level, based on the subtraction of the estimated phrase level contour from the original one; thus considering that the syllable has an additive prosodic effect over the phrase level. We also compare the use of 3 different decision and regression tree algorithms for DCT coefficients clustering and prediction. Additional features are selected based on a greedy stepwise without backtracking feature selection method. The results support the proposed method through low average square errors and little or no perceivable errors in the synthesized speech.

机译：本文根据F0轮廓的DCT（离散余弦变换）参数化来解决超定位模型的思想。我们检查DCT系数的容量，以估计音节水平的F0轮廓中的快速变化以及短语级别的整体趋势。该方法基于来自原始估计短语级别轮廓的减法来确定音节级别的系数; 因此，考虑到音节对短语水平具有添加剂韵律效应。我们还比较3不同决策和回归树算法的DCT系数聚类和预测。基于逐步选择的其他功能，无需回溯特征选择方法。结果通过低平均平均误差支持所提出的方法，并且在合成语音中很少或没有可感知的误差。

著录项

来源
《Conference on Speech Technology and Human-Computer Dialogue》|2011年||共6页
会议地点
作者
Stan Adriana; Giurgiu Mircea;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
DCT; F0 modelling; pitch; prosody;

机译：DCT;F0建模;俯仰;韵律;

相似文献

外文文献
中文文献
专利

1. Superpositional HMM-Based Intonation Synthesis Using a Functional F0 Model [J] . Ni Jinfu, Shiga Yoshinori, Hori Chiori Journal of signal processing systems for signal, image, and video technology . 2016,第2期

机译：使用功能性F0模型的基于HMM的叠加音调合成
2. F0 Contour Modeling for Arabic Text-to-Speech Synthesis Using Fujisaki Parameters and Neural Networks [J] . Fatouma Boukadida, Noureddine Ellouze, Zied Mnasri Signal Processing: An International Journal . 2011,第6期

机译：使用Fujisaki参数和神经网络的F0轮廓建模，用于阿拉伯文本到语音的合成
3. Neural network-based F0 text-to-speech synthesiser for Mandarin [J] . Hwang S.-H., Chen S.-H. IEE Proceedings. Part K . 1994,第6期

机译：基于神经网络的普通话F0语音合成器
4. A superpositional model applied to F0 parameterization using DCT for text-to-speech synthesis [C] . Stan Adriana, Giurgiu Mircea Proceedings of the 6th International Conference on Speech Technology and Human-Computer Dialogue . 2011

机译：叠加模型应用于使用DCT进行F0参数化的文本到语音合成
5. Improving high quality concatenative text-to-speech synthesis using the circular linear prediction model. [D] . Shukla, Sunil Ravindra. 2007

机译：使用圆形线性预测模型改善高质量的串联文本到语音合成。
6. A Parameterized Model of Amylopectin Synthesis Provides Key Insights into the Synthesis of Granular Starch [O] . Alex Chi Wu, Matthew K. Morell, Robert G. Gilbert -1

机译：支链淀粉合成的参数化模型提供了颗粒淀粉合成的关键见解
7. DCT-BASED AMPLITUDE AND FREQUENCY MODULATED HARMONIC-PLUS-NOISE MODELLING FOR TEXT-TO-SPEECH SYNTHESIS [O] . Kris Hermus, Hugo Van Hamme, Werner Verhelst, 2010

机译：用于文本到语音合成的基于DCT的幅度和频率调制的谐波 - 噪声 - 噪声建模

A superpositional model applied to F0 parameterization using DCT for text-to-speech synthesis

摘要

著录项

相似文献

相关主题

期刊订阅