首页> 外文会议>Image and Graphics, 2009. ICIG '09 >Video Realistic Mouth Animation Based on an Audio Visual DBN Model with Articulatory Features and Constrained Asynchrony

【24h】

Video Realistic Mouth Animation Based on an Audio Visual DBN Model with Articulatory Features and Constrained Asynchrony

机译：基于具有发音特征和受限异步性的视听DBN模型的视频逼真嘴部动画

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a mouth animation construction method based on the DBN models with articulatory features (AF_AVDBN), in which the articulatory features of lips, tongue, glottis/velum can be asynchronous within a maximum asynchrony constraint to describe the speech production process more reasonably. Given an audio input and the trained AF_AVDBN models, the optimal visual feature learning algorithm is deduced based on the Maximum Likelihood Estimation criterion. The learned visual features are then used to construct the mouth images for the input speech. Objective and subjective evaluations on the mouth animations of 110 speech sentences show that the learned visual features from the AF_AVDBN models track the real visual features very closely, and the constructed mouth images from the AF_AVDBN models are very much like the real ones.

机译：本文提出了一种基于具有发音特征的DBN模型（AF_AVDBN）的嘴部动画构造方法，其中嘴唇，舌头，声门/ velum的发音特征可以在最大异步约束内异步，从而更合理地描述语音生成过程。给定音频输入和训练有素的AF_AVDBN模型，根据最大似然估计准则推导最佳视觉特征学习算法。然后，将学习到的视觉特征用于构造用于输入语音的口部图像。对110个语音句子的嘴部动画进行的主观和主观评估表明，从AF_AVDBN模型中学习到的视觉特征非常紧密地跟踪了真实的视觉特征，并且从AF_AVDBN模型中构造的嘴部图像非常类似于真实的视觉特征。

著录项

来源
《Image and Graphics, 2009. ICIG '09》|2010年|658-662|共5页
会议地点
作者
Dongmei Jiang; Peizhen Liu; Ravyse I.; Sahli H.; Verhelst W.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41;
关键词
AF_AVDBN; articulatory features; asynchrony; mouth animation;

机译：AF_AVDBN;发音特征;异步;嘴部动画;

相似文献

外文文献
中文文献
专利

1. Speech driven photo realistic facial animation based on an articulatory DBN model and AAM features [J] . Dongmei Jiang, Yong Zhao, Hichem Sahli, Multimedia Tools and Applications . 2014,第1期

机译：基于发音DBN模型和AAM功能的语音驱动的照片逼真的面部动画
2. A realistic 3D articulatory animation system for emotional visual pronunciation [J] . Yu Lingyun, Yu Jun, Wang Zengfu Multimedia Tools and Applications . 2017,第18期

机译：逼真的3D语音动画发音系统
3. Realistic Mouth-Synching for Speech-Driven Talking Face Using Articulatory Modelling [J] . Lei Xie, Zhi-Qiang Liu IEEE transactions on multimedia . 2007,第期

机译：使用发音模型对语音驱动的说话人脸进行逼真的嘴部同步
4. Video Realistic Mouth Animation Based on an Audio Visual DBN Model with Articulatory Features and Constrained Asynchrony [C] . Dongmei Jiang, Peizhen Liu, Use Ravyse, Image and Graphics . 2009

机译：基于具有发音特征和受限异步性的视听DBN模型的视频逼真嘴部动画
5. Audio-Visual Asynchrony Modeling and Analysis for Speech Alignment and Recognition. [D] . Terry, Louis. 2011

机译：语音对齐和识别的视听异步建模和分析。
6. Tolerance for audiovisual asynchrony is enhanced by the spectrotemporal fidelity of the speaker’s mouth movements and speech [O] . Antoine J Shahin, Stanley Shen, Jess R Kerlin -1

机译：说话者的嘴巴动作和言语的时空保真度提高了视听异步的容忍度
7. A framework for event detection in field-sports video broadcasts based on SVM generated audio-visual feature model. Case-study: soccer video [O] . Sadlier David A., OConnor Noel E., Murphy Noel, 2004

机译：基于sVm生成的视听特征模型的现场体育视频广播事件检测框架。案例研究：足球视频

Video Realistic Mouth Animation Based on an Audio Visual DBN Model with Articulatory Features and Constrained Asynchrony

摘要

著录项

相似文献

相关主题

期刊订阅