【24h】

Multipulse Sequences for Residual Signal Modeling

机译:用于残留信号建模的多脉冲序列

获取原文

摘要

In source-filter models of speech production, the residual signal - what remains after passing the speech signal through the inverse filter - contains important information for the generation of naturally sounding re-synthesized speech. Typically, the voiced regions of residual signals are regarded as a mixture of glottal pulse and noise. This paper introduces a novel approach to represent the noise component of voiced regions of residual signals through autoregressive filtering of multipulse sequences. The positions and amplitudes of the non-zero samples of these multipulse signals are optimized through a closed-loop procedure. The method in question is applied to excitation modeling in statistical parametric synthesis. Experimental results indicate that the use of multipulse-based noise component construction eliminates the necessity of run-time ad hoc procedures such as high-pass filtering and time modulation, common on excitation models for statistical parametric synthesizers, with no loss of synthesized speech quality.
机译:在语音产生的源滤波器模型中,残留信号-在将语音信号通过反滤波器后剩下的-包含重要信息,用于生成自然听起来重新合成的语音。通常,残留信号的浊音区域被认为是声门脉冲和噪声的混合。本文介绍了一种通过对多脉冲序列进行自回归滤波来表示残差信号浊音区域的噪声分量的新颖方法。这些多脉冲信号的非零采样的位置和幅度通过闭环过程进行了优化。所讨论的方法应用于统计参数综合中的激​​励建模。实验结果表明,使用基于多脉冲的噪声分量构造消除了运行时自组织程序(如高通滤波和时间调制)的必要性,这是统计参数合成器的激励模型上常见的,并且不会损失合成语音质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号