Dynamic visual features for audio-visual speaker verification

David Dean; Sridha Sridharan

首页> 外文期刊>Computer speech and language >Dynamic visual features for audio-visual speaker verification

【24h】

Dynamic visual features for audio-visual speaker verification

机译：动态视觉功能，用于视听扬声器验证

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The cascading appearance-based (CAB) feature extraction technique has established itself as the state-of-the-art in extracting dynamic visual speech features for speech recognition. In this paper, we will focus on investigating the effectiveness of this technique for the related speaker verification application. By investigating the speaker verification ability of each stage of the cascade we will demonstrate that the same steps taken to reduce static speaker and environmental information for the visual speech recognition application also provide similar improvements for visual speaker recognition. A further study is conducted comparing synchronous HMM (SHMM) based fusion of CAB visual features and traditional perceptual linear predictive (PLP) acoustic features to show that higher complexity inherit in the SHMM approach docs not appear to provide any improvement in the final audio-visual speaker verification system over simpler utterance level score fusion.

机译：基于级联外观的（CAB）特征提取技术已成为提取动态视觉语音特征进行语音识别的最新技术。在本文中，我们将重点研究该技术在相关说话人验证应用中的有效性。通过研究级联每个阶段的说话人验证能力，我们将证明减少视觉语音识别应用程序的静态说话人和环境信息的相同步骤也为视觉说话人识别提供了类似的改进。进行了进一步的研究，比较了基于同步HMM（SHMM）的CAB视觉特征与传统感知线性预测（PLP）声学特征的融合，以显示SHMM方法文档中继承的较高复杂性似乎无法为最终的视听提供任何改进说话人验证系统，通过更简单的话语水平得分融合。

著录项

来源
《Computer speech and language》 |2010年第2期|136-149|共14页
作者
David Dean; Sridha Sridharan;
展开▼
作者单位

Speech, Audio, Image and Video Research Laboratory, Queensland University of Technology, George St., Brisbane, Australia;

Speech, Audio, Image and Video Research Laboratory, Queensland University of Technology, George St., Brisbane, Australia;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
audio-visual speaker recognition; cascading appearance-based features; synchronous hidden markov models;

机译：视听说话人识别;级联基于外观的功能;同步隐藏马尔可夫模型;

相似文献

外文文献
中文文献
专利

1. Omnidirectional Audio-Visual Talker Localization Based on Dynamic Fusion of Audio-Visual Features Using Validity and Reliability Criteria [J] . Yuki DENDA, Takanobu NISHIURA, Yoichi YAMASHITA IEICE Transactions on Information and Systems . 2008,第3期

机译：基于有效性和可靠性准则的视听特征动态融合的全向视听讲话者定位
2. Audio-Visual Based Multi-Sample Fusion to Enhance Correlation Filters Speaker Verification System [J] . Dzati Athiar Ramli, Salina Abdul Samad, Aini Hussain International Journal on Computer Science and Engineering . 2010,第4期

机译：基于视听的多样本融合以增强相关滤波器说话者验证系统
3. Feature Fusion Based Audio-Visual Speaker Identification Using Hidden Markov Model under Different Lighting Variations [J] . Md. RabiulIslam, Md. AbdusSobhan Applied computational intelligence and soft computing . 2014,第1期

机译：不同光照变化下基于隐马尔可夫模型的基于特征融合的视听说话人识别
4. Audio-Visual Speaker Identification Based on the Use of Dynamic Audio and Visual Features [C] . Niall Fox, Richard B. Reilly 4th International Conference on Audio-and Video-Based Biometric Person Authentication AVBPA 2003 Jun 9-11, 2003 Guildford, UK . 2003

机译：基于动态视听特征的视听说话人识别
5. Robust speech processing based on microphone array, audio-visual, and frame selection for in-vehicle speech recognition and in-set speaker recognition. [D] . Zhang, Xianxian. 2005

机译：基于麦克风阵列，视听和帧选择的强大语音处理功能，可实现车载语音识别和内置说话人识别。
6. Multimodal Speaker Diarization Using a Pre-Trained Audio-Visual Synchronization Model [O] . Rehan Ahmad, Syed Zubair, Hani Alquhayz, 2019

机译：使用预训练的视听同步模型进行多模态扬声器二分法
7. Dynamic visual features for audio-visual speaker verification [O] . Dean David B., Sridharan Sridha 2010

机译：动态视觉功能，用于视听扬声器验证

Dynamic visual features for audio-visual speaker verification

摘要

著录项

相似文献

相关主题

期刊订阅