首页> 外文会议>Conference on Speech Technology and Human-Computer Dialogue >A superpositional model applied to F0 parameterization using DCT for text-to-speech synthesis
【24h】

A superpositional model applied to F0 parameterization using DCT for text-to-speech synthesis

机译:使用DCT应用于F0参数化的超导模型,用于语音合成文本

获取原文

摘要

This paper addresses the idea of the superpositional model based on the DCT (Discrete Cosine Transform) parameterization of the F0 contours. We examine the capacity of the DCT coefficients to estimate the fast variations in the F0 contour at syllable level and also the overall trend of the phrase level. The method determines the coefficients at syllable level, based on the subtraction of the estimated phrase level contour from the original one; thus considering that the syllable has an additive prosodic effect over the phrase level. We also compare the use of 3 different decision and regression tree algorithms for DCT coefficients clustering and prediction. Additional features are selected based on a greedy stepwise without backtracking feature selection method. The results support the proposed method through low average square errors and little or no perceivable errors in the synthesized speech.
机译:本文根据F0轮廓的DCT(离散余弦变换)参数化来解决超定位模型的思想。 我们检查DCT系数的容量,以估计音节水平的F0轮廓中的快速变化以及短语级别的整体趋势。 该方法基于来自原始估计短语级别轮廓的减法来确定音节级别的系数; 因此,考虑到音节对短语水平具有添加剂韵律效应。 我们还比较3不同决策和回归树算法的DCT系数聚类和预测。 基于逐步选择的其他功能,无需回溯特征选择方法。 结果通过低平均平均误差支持所提出的方法,并且在合成语音中很少或没有可感知的误差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号