...
首页> 外文期刊>Computer speech and language >Speaker adaptive voice source modeling with applications to speech coding and processing
【24h】

Speaker adaptive voice source modeling with applications to speech coding and processing

机译:说话人自适应语音源建模及其在语音编码和处理中的应用

获取原文
获取原文并翻译 | 示例
           

摘要

We discuss the use of low-dimensional physical models of the voice source for speech coding and processing applications. A class of waveform-adaptive dynamic glottal models and parameter identification procedures are illustrated. The model and the identification procedures are assessed by addressing signal transformations on recorded speech, achievable by fitting the model to the data, and then acting on the physically oriented parameters of the voice source. The class of models proposed provides in principle a tool for both the estimation of glottal source signals, and the encoding of the speech signal for transformation purposes. The application of this model to time stretching and to fundamental frequency control (pitch shifting) is also illustrated. The experiments show that copy synthesis is perceptually very similar to the target, and that time stretching and "pitch extrapolation" effects can be obtained by simple control strategies.
机译:我们讨论了语音源的低维物理模型在语音编码和处理应用中的使用。说明了一类波形自适应动态声门模型和参数识别过程。通过处理记录的语音上的信号转换来评估模型和识别过程,可以通过将模型拟合到数据,然后对语音源的物理定向参数进行操作来实现。所提出的模型类别原则上为声门信号的估计和语音信号的编码提供了一种工具,以进行转换。还说明了该模型在时间拉伸和基本频率控制(音调偏移)中的应用。实验表明,复制合成在感觉上与目标非常相似,并且可以通过简单的控制策略获得时间延长和“音高外推”效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号