Linear interpolation of spectrotemporal excitation pattern representations for automatic speech recognition in the presence of noise

机译：光谱仪激励模式表示的线性插值在噪声存在下自动语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This article is based on the study of new methods to improve recognition capabilities of automatic speech recognition in the presence of noise systems. Instead of trying to modify complex recognition models, the study is aimed at enhancing the input data's reliability. This is achieved through processing of the acoustic representations of speech. One of these representations, called SpectroTemporal Excitation Pattern (STEP) is used in recognition systems with missing or unreliable data. One of the ideas behind this study was to increase the glimpsing areas in the STEP representations. And, because the glimpsing algorithm requires previous knowledge of the noise, another idea was to estimate noise characteristics, and base the glimpsing areas determination on these estimations. Preliminary tests were conducted with an HMM recognition system, but this will be the object of a future study.

机译：本文基于研究新方法，以提高在存在噪声系统中的自动语音识别的识别能力。该研究旨在提高输入数据的可靠性，而不是尝试修改复杂的识别模型。这是通过处理语音的声学表示来实现的。这些表示中的一个称为光谱仪激励模式（步骤）用于丢失或不可靠的数据的识别系统中。这项研究背后的想法之一是增加阶梯表示中的瞥见区域。并且，由于瞥见算法需要先前的噪声知识，因此另一个想法是估计噪声特性，并且基于这些估计的闪烁区域确定。用嗯识别系统进行初步测试，但这将是未来研究的对象。

著录项

来源
《Conference on Speech Technology and Human-Computer Dialogue》|2009年||共6页
会议地点
作者

展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类电声技术和语音信号处理;
关键词
hidden Markov models; interpolation; speech recognition; automatic speech recognition; glimpsing algorithm; linear interpolation; noise systems; spectrotemporal excitation pattern representations; STEP representation; glimpsing; speech recognition in noise;

机译：隐马尔可夫模型;插值;语音识别;自动语音识别;瞥见算法;线性插值;噪声系统;光谱励磁模式表示;步骤表示;瞥见噪音;

相似文献

外文文献
中文文献
专利

1. The role of binary mask patterns in automatic speech recognition in background noise [J] . Narayanan A., Wang D. The Journal of the Acoustical Society of America . 2013,第5aPta1期

机译：二进制掩码模式在背景噪声中自动语音识别中的作用
2. Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition [J] . Gemmeke J. F., Virtanen T., Hurmalainen A. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第7期

机译：基于示例的稀疏表示，用于噪声鲁棒的自动语音识别
3. IMPROVING EIGENSPACE-BASED FUZZY LOGIC SYSTEM USING A LINEAR INTERPOLATION SCHEME FOR SPEECH PATTERN RECOGNITION [J] . Ing-Jr Ding, Chih-Ta Yen Transactions of the Canadian Society for Mechanical Engineering . 2013,第3期

机译：使用线性插值方案改进基于特征空间的语音图形识别
4. Linear interpolation of spectrotemporal excitation pattern representations for automatic speech recognition in the presence of noise [C] . Stan A. Speech Technology and Human-Computer Dialogue, 2009. SpeD '09 . 2009

机译：频谱时激励模式表示的线性插值用于在有噪声的情况下自动语音识别
5. Compressive nonlinearity for representing speech spectral magnitude to improve noise robustness of automatic speech recognition . [D] . Wong, Brian. 2011

机译：压缩非线性表示语音频谱幅度提高语音自动识别的鲁棒性。
6. The role of binary mask patterns in automatic speech recognition in backgroundnoise [O] . Arun Narayanan, a), DeLiang Wang -1

机译：二进制掩码模式在背景中自动语音识别中的作用噪声
7. Speech Recognition with Linear and Non-linear Amplification in the Presence of Industrial Noise [O] . Marcia Olson 2000

机译：在存在工业噪声的情况下具有线性和非线性放大的语音识别
8. Automatic Speech Recognition in the Presence of Co-Channel Speech Interference [R] . Lim, A. W. 1990

机译：同声道语音干扰下的自动语音识别

Linear interpolation of spectrotemporal excitation pattern representations for automatic speech recognition in the presence of noise

摘要

著录项

相似文献

相关主题

期刊订阅