首页> 外文会议>Asia-Pacific Signal and Information Processing Association Annual Summit and Conference >Photo-Realistic Mouth Animation Based on an Asynchronous Articulatory DBN Model for Continuous Speech
【24h】

Photo-Realistic Mouth Animation Based on an Asynchronous Articulatory DBN Model for Continuous Speech

机译:基于异步发音DBN模型的连续语音的逼真的口部动画

获取原文

摘要

This paper proposes a continuous speech driven photo realistic visual speech synthesis approach based on an articulatory dynamic Bayesian network model (AF_AVDBN) with constrained asynchrony. In the training of the AF_AVDBN model, the perceptual linear prediction (PLP) features and YUV features are extracted as acoustic and visual features respectively. Given an input speech and the trained AF_AVDBN parameters, an EM-based algorithm is deduced to learn the optimal YUV features, which are then used, together with the compensated high frequency components, to synthesize the mouth animation corresponding to the input speech. In the experiments, mouth animations are synthesized for 80 connected digit speech sentences. Both qualitative and quantitative evaluation results show that the proposed method is capable of synthesizing more natural, clear and accurate mouth animations than those from the state asynchronous DBN model (S_A_DBN).
机译:本文提出了一种基于异步约束的动态贝叶斯网络模型(AF_AVDBN)的连续语音驱动的照片逼真的视觉语音合成方法。在AF_AVDBN模型的训练中,分别将感知线性预测(PLP)特征和YUV特征提取为声学和视觉特征。给定输入语音和训练有素的AF_AVDBN参数,推导基于EM的算法以学习最佳YUV特征,然后将其与补偿的高频分量一起用于合成与输入语音相对应的嘴部动画。在实验中,为80个相连的数字语音句子合成了嘴部动画。定性和定量评估结果均表明,与状态异步DBN模型(S_A_DBN)相比,该方法能够合成更自然,清晰和准确的嘴部动画。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号