Photo-Realistic Mouth Animation Based on an Asynchronous Articulatory DBN Model for Continuous Speech

机译：基于异步发音DBN模型的连续语音的逼真的口部动画

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a continuous speech driven photo realistic visual speech synthesis approach based on an articulatory dynamic Bayesian network model (AF_AVDBN) with constrained asynchrony. In the training of the AF_AVDBN model, the perceptual linear prediction (PLP) features and YUV features are extracted as acoustic and visual features respectively. Given an input speech and the trained AF_AVDBN parameters, an EM-based algorithm is deduced to learn the optimal YUV features, which are then used, together with the compensated high frequency components, to synthesize the mouth animation corresponding to the input speech. In the experiments, mouth animations are synthesized for 80 connected digit speech sentences. Both qualitative and quantitative evaluation results show that the proposed method is capable of synthesizing more natural, clear and accurate mouth animations than those from the state asynchronous DBN model (S_A_DBN).

机译：本文提出了一种基于异步约束的动态贝叶斯网络模型（AF_AVDBN）的连续语音驱动的照片逼真的视觉语音合成方法。在AF_AVDBN模型的训练中，分别将感知线性预测（PLP）特征和YUV特征提取为声学和视觉特征。给定输入语音和训练有素的AF_AVDBN参数，推导基于EM的算法以学习最佳YUV特征，然后将其与补偿的高频分量一起用于合成与输入语音相对应的嘴部动画。在实验中，为80个相连的数字语音句子合成了嘴部动画。定性和定量评估结果均表明，与状态异步DBN模型（S_A_DBN）相比，该方法能够合成更自然，清晰和准确的嘴部动画。

著录项

来源
《Asia-Pacific Signal and Information Processing Association Annual Summit and Conference》|2011年|1-4|共4页
会议地点
作者
He Zhang; Dongmei Jiang; Peng Wu; Hichem Sahli;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算机的应用;信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. Speech driven photo realistic facial animation based on an articulatory DBN model and AAM features [J] . Dongmei Jiang, Yong Zhao, Hichem Sahli, Multimedia Tools and Applications . 2014,第1期

机译：基于发音DBN模型和AAM功能的语音驱动的照片逼真的面部动画
2. Articulatory feature based continuous speech recognition using probabilistic lexical modeling [J] . Ramya Rasipuram, Mathew Magimai.-Doss Computer speech and language . 2016,第Mara期

机译：基于发音特征的概率词汇建模的连续语音识别
3. Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modelling [J] . Lei Xie, Zhi-Qiang Liu IEEE transactions on multimedia . 2007,第期

机译：使用发音模型对语音驱动的说话人脸进行逼真的嘴部同步
4. Photo-Realistic Mouth Animation Based on an Asynchronous Articulatory DBN Model for Continuous Speech [C] . He Zhang, Dongmei Jiang, Peng Wu, Asia-Pacific Signal and Information Processing Association Annual Summit and Conference . 2011

机译：基于异步剖视DBN模型的光学 - 现实嘴动画，用于连续语音
5. Articulatory speech synthesis and speech production modelling. [D] . Huang, Jun. 2001

机译：发音语音合成和语音产生建模。
6. A study of acoustic-to-articulatory inversion of speech by analysis-by-synthesis using chain matrices and the Maeda articulatory model [O] . Sankaran Panchapagesan, Abeer Alwan -1

机译：使用链矩阵和前田发音模型通过合成分析对语音进行语音到发音发音转换的研究
7. PHMM BASED ASYNCHRONOUS ACOUSTIC MODEL FOR CHINESE LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION [O] . Hao Wu, Xihong Wu, Huisheng Chi 2016

机译：基于pHmm的异步声学模型用于中国大型词汇连续语音识别

Photo-Realistic Mouth Animation Based on an Asynchronous Articulatory DBN Model for Continuous Speech

摘要

著录项

相似文献

相关主题

期刊订阅