首页> 外文会议>Speech Technology and Human-Computer Dialogue, 2009. SpeD '09 >Linear interpolation of spectrotemporal excitation pattern representations for automatic speech recognition in the presence of noise
【24h】

Linear interpolation of spectrotemporal excitation pattern representations for automatic speech recognition in the presence of noise

机译:频谱时激励模式表示的线性插值用于在有噪声的情况下自动语音识别

获取原文

摘要

This article is based on the study of new methods to improve recognition capabilities of automatic speech recognition in the presence of noise systems. Instead of trying to modify complex recognition models, the study is aimed at enhancing the input data's reliability. This is achieved through processing of the acoustic representations of speech. One of these representations, called SpectroTemporal Excitation Pattern (STEP) is used in recognition systems with missing or unreliable data. One of the ideas behind this study was to increase the glimpsing areas in the STEP representations. And, because the glimpsing algorithm requires previous knowledge of the noise, another idea was to estimate noise characteristics, and base the glimpsing areas determination on these estimations. Preliminary tests were conducted with an HMM recognition system, but this will be the object of a future study.
机译:本文基于对新方法的研究,以提高存在噪声系统时自动语音识别的识别能力。该研究没有试图修改复杂的识别模型,而是旨在提高输入数据的可靠性。这是通过处理语音的声音表示来实现的。这些表示中的一种,称为光谱暂时性激发模式(STEP),用于数据缺失或不可靠的识别系统中。这项研究背后的想法之一是增加STEP表示中的瞥见区域。并且,由于瞥见算法需要事先了解噪声,因此另一个想法是估计噪声特征,并根据这些估计来确定瞥见区域。使用HMM识别系统进行了初步测试,但这将是未来研究的目标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号