首页> 外文会议>Twenty-Fifth Australasian Computer Science Conference; Jan/Feb, 2002; Monash University, Melbourne >Audio-Visual Speech Recognition using Red Exclusion and Neural Networks
【24h】

Audio-Visual Speech Recognition using Red Exclusion and Neural Networks

机译:使用红色排斥和神经网络的视听语音识别

获取原文
获取原文并翻译 | 示例

摘要

Automatic speech recognition (ASR) performs well under restricted conditions, but performance degrades in noisy environments. Audio-Visual Speech Recognition (AVSR) combats this by incorporating a visual signal into the recognition. This paper briefly reviews the contribution of psycholinguistics to this endeavour and the recent advances in machine AVSR. An important first step in AVSR is that of feature extraction from the mouth region and a technique developed by the authors is breifly presented. This paper examines examine how useful this extraction technique in combination with several integration arhitectures is at the given task, demonstrates that vision does infact assist speech recognition when used in a linguistically guided fashion, and gives insight remaining issues.
机译:自动语音识别(ASR)在受限条件下的性能很好,但是在嘈杂的环境中性能会下降。视听语音识别(AVSR)通过将视觉信号合并到识别中来解决此问题。本文简要回顾了心理语言学对这一努力的贡献以及机器AVSR的最新进展。 AVSR的重要第一步是从嘴巴区域提取特征,并简要介绍了作者开发的一种技术。本文研究了将提取技术与几种集成架构结合使用在给定任务上的有用性,证明了以语言指导的方式使用视觉时,视觉确实可以帮助语音识别,并给出了尚存的问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号