Audio-Visual Speech Recognition using Red Exclusion and Neural Networks

机译：使用红色排斥和神经网络的视听语音识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic speech recognition (ASR) performs well under restricted conditions, but performance degrades in noisy environments. Audio-Visual Speech Recognition (AVSR) combats this by incorporating a visual signal into the recognition. This paper briefly reviews the contribution of psycholinguistics to this endeavour and the recent advances in machine AVSR. An important first step in AVSR is that of feature extraction from the mouth region and a technique developed by the authors is breifly presented. This paper examines examine how useful this extraction technique in combination with several integration arhitectures is at the given task, demonstrates that vision does infact assist speech recognition when used in a linguistically guided fashion, and gives insight remaining issues.

机译：自动语音识别（ASR）在受限条件下的性能很好，但是在嘈杂的环境中性能会下降。视听语音识别（AVSR）通过将视觉信号合并到识别中来解决此问题。本文简要回顾了心理语言学对这一努力的贡献以及机器AVSR的最新进展。 AVSR的重要第一步是从嘴巴区域提取特征，并简要介绍了作者开发的一种技术。本文研究了将提取技术与几种集成架构结合使用在给定任务上的有用性，证明了以语言指导的方式使用视觉时，视觉确实可以帮助语音识别，并给出了尚存的问题。

著录项

来源
《Twenty-Fifth Australasian Computer Science Conference; Jan/Feb, 2002; Monash University, Melbourne》|2002年|p.149-156|共8页
会议地点 Melbourne(AU);Melbourne(AU)
作者
Trent W. Lewis; David M.W. Powers;
展开▼
作者单位

School of Informatics and Engineering Flinders University of South Australia, PO Box 2100, Adelaide, South Australia 5001;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信息与知识传播;
关键词
audio-visual speech recogition; neural networks; sensor fusion;

机译：视听语音识别；神经网络；传感器融合;

相似文献

外文文献
中文文献
专利

1. Audio-visual feature fusion via deep neural networks for automatic speech recognition [J] . Mohammad Hasan Rahmani, Farshad Almasganj, Seyyed Ali Seyyedsalehi Digital Signal Processing . 2018,第期

机译：通过深度神经网络进行视听功能融合，用于自动语音识别
2. Audio-visual feature fusion via deep neural networks for automatic speech recognition [J] . Mohammad Hasan Rahmani, Farshad Almasganj, Seyyed Ali Seyyedsalehi Digital Signal Processing . 2018,第期

机译：通过深度神经网络进行视听功能融合，用于自动语音识别
3. Recognition of words from brain-generated signals of speech-impaired people: Application of autoencoders as a neural Turing machine controller in deep neural networks [J] . Boloukian Behzad, Safi-Esfahani Faramarz Neural Networks: The Official Journal of the International Neural Network Society . 2020,第期

机译：识别语音障碍的脑生成信号的单词：AutoEncoders在深神经网络中的神经图定型机控制器中的应用
4. Audio-visual speech recognition using red exclusion and neural networks [C] . Trent W. Lewis, David M. W. Powers Australasian conference on Computer science . 2002

机译：使用红色排除和神经网络的视听语音识别
5. Large-Margin Structured Prediction Extensions of Neural Networks for Automatic Speech Recognition [D] . Ravuri, Suman V. 2015

机译：用于自动语音识别的神经网络的大边缘结构预测扩展
6. Multi-resolution speech analysis for automatic speech recognition using deep neural networks: Experiments on TIMIT [O] . Doroteo T. Toledano, María Pilar Fernández-Gallego, Alicia Lozano-Diez 2012

机译：基于深度神经网络的自动语音识别的多分辨率语音分析：TIMIT实验
7. Audio-Visual Speech Recognition using Redud Exclusion an Neural Networks [O] . Powers David Martin, Lewis Trent Wilson 2003

机译：使用Red ud进行视听语音识别排除神经网络

Audio-Visual Speech Recognition using Red Exclusion and Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅