首页> 外文会议>Image and Graphics, 2009. ICIG '09 >Video Realistic Mouth Animation Based on an Audio Visual DBN Model with Articulatory Features and Constrained Asynchrony
【24h】

Video Realistic Mouth Animation Based on an Audio Visual DBN Model with Articulatory Features and Constrained Asynchrony

机译:基于具有发音特征和受限异步性的视听DBN模型的视频逼真嘴部动画

获取原文

摘要

This paper presents a mouth animation construction method based on the DBN models with articulatory features (AF_AVDBN), in which the articulatory features of lips, tongue, glottis/velum can be asynchronous within a maximum asynchrony constraint to describe the speech production process more reasonably. Given an audio input and the trained AF_AVDBN models, the optimal visual feature learning algorithm is deduced based on the Maximum Likelihood Estimation criterion. The learned visual features are then used to construct the mouth images for the input speech. Objective and subjective evaluations on the mouth animations of 110 speech sentences show that the learned visual features from the AF_AVDBN models track the real visual features very closely, and the constructed mouth images from the AF_AVDBN models are very much like the real ones.
机译:本文提出了一种基于具有发音特征的DBN模型(AF_AVDBN)的嘴部动画构造方法,其中嘴唇,舌头,声门/ velum的发音特征可以在最大异步约束内异步,从而更合理地描述语音生成过程。给定音频输入和训练有素的AF_AVDBN模型,根据最大似然估计准则推导最佳视觉特征学习算法。然后,将学习到的视觉特征用于构造用于输入语音的口部图像。对110个语音句子的嘴部动画进行的主观和主观评估表明,从AF_AVDBN模型中学习到的视觉特征非常紧密地跟踪了真实的视觉特征,并且从AF_AVDBN模型中构造的嘴部图像非常类似于真实的视觉特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号