Speaker adaptive voice source modeling with applications to speech coding and processing

Carlo Drioli; Andrea Calanca

首页> 外文期刊>Computer speech and language >Speaker adaptive voice source modeling with applications to speech coding and processing

【24h】

Speaker adaptive voice source modeling with applications to speech coding and processing

机译：说话人自适应语音源建模及其在语音编码和处理中的应用

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We discuss the use of low-dimensional physical models of the voice source for speech coding and processing applications. A class of waveform-adaptive dynamic glottal models and parameter identification procedures are illustrated. The model and the identification procedures are assessed by addressing signal transformations on recorded speech, achievable by fitting the model to the data, and then acting on the physically oriented parameters of the voice source. The class of models proposed provides in principle a tool for both the estimation of glottal source signals, and the encoding of the speech signal for transformation purposes. The application of this model to time stretching and to fundamental frequency control (pitch shifting) is also illustrated. The experiments show that copy synthesis is perceptually very similar to the target, and that time stretching and "pitch extrapolation" effects can be obtained by simple control strategies.

机译：我们讨论了语音源的低维物理模型在语音编码和处理应用中的使用。说明了一类波形自适应动态声门模型和参数识别过程。通过处理记录的语音上的信号转换来评估模型和识别过程，可以通过将模型拟合到数据，然后对语音源的物理定向参数进行操作来实现。所提出的模型类别原则上为声门信号的估计和语音信号的编码提供了一种工具，以进行转换。还说明了该模型在时间拉伸和基本频率控制（音调偏移）中的应用。实验表明，复制合成在感觉上与目标非常相似，并且可以通过简单的控制策略获得时间延长和“音高外推”效果。

著录项

来源
《Computer speech and language》 |2014年第5期|1195-1208|共14页
作者
Carlo Drioli; Andrea Calanca;
展开▼
作者单位

Department of Mathematics and Computer Science, University of Udine, Udine, Italy;

Department of Computer Science, University of Verona, Verona, Italy;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Glottal modeling; Model inversion; Model-based transformations; Speech synthesis and processing;

机译：声门建模;模型反演基于模型的转换;语音合成与处理;

相似文献

外文文献
中文文献
专利

1. Speech separation using speaker-adapted eigenvoice speech models [J] . Ron J. Weiss, Daniel P.W. Ellis Computer speech and language . 2010,第1期

机译：使用说话者自适应的本征语音模型进行语音分离
2. Average-Voice-Based Speech Synthesis Using HSMM-Based Speaker Adaptation and Adaptive Training [J] . Junichi YAMAGISHI, Takao KOBAYASHI IEICE Transactions on Information and Systems . 2007,第2期

机译：基于HSMM的说话人自适应和自适应训练的基于平均语音的语音合成
3. Low-complexity source coding using Gaussian mixture models, lattice vector quantization, and recursive coding with application to speech spectrum quantization [J] . Subramaniam A.D., Gardner W.R., Rao B.D. IEEE transactions on audio, speech and language processing . 2006,第2期

机译：使用高斯混合模型，晶格矢量量化和递归编码的低复杂度源编码，并将其应用于语音频谱量化
4. A voiced/unvoiced classified vector quantized speech transform coder implemented on a TMS 32020 signal processor [C] . Fjallbrant, T., Mekuria, . 1988

机译：在TMS 32020信号处理器上实现的有声/无声分类矢量量化语音变换编码器
5. The Voice Source in Speech Production: From Models to Applications. [D] . Chen, Gang. 2014

机译：语音制作中的语音源：从模型到应用程序。
6. Phoneme restoration and empirical coverage of interactive activation and adaptive resonance models of human speech processing [O] . James S. Magnuson, a) -1

机译：人语音处理的交互激活和自适应共振模型的音素恢复和经验覆盖
7. Speaker adaptive voice source modeling with applications to speech coding and processing [O] . Carlo Drioli, Andrea Calanca 2014

机译：扬声器自适应语音源建模及其在语音编码和处理中的应用

Speaker adaptive voice source modeling with applications to speech coding and processing

摘要

著录项

相似文献

相关主题

期刊订阅