Robust speech recognition and feature extraction using HMM2

Katrin Weber; Shajith Ikbal; Samy Bengio; Herve Bourlard

首页> 外文期刊>Computer speech and language >Robust speech recognition and feature extraction using HMM2

【24h】

Robust speech recognition and feature extraction using HMM2

机译：使用HMM2进行可靠的语音识别和特征提取

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents the theoretical basis and preliminary experimental results of a new HMM model. referred to as HMM2, which can be considered as a mixture of HMMs. In this new model, the emission probabilities of the temporal (primary) HMM are estimated through secondary, state specific, HMMs working in the acoustic feature space. Thus, while the primary HMM is performing the usual time warping and integration, the secondary HMMs are responsible for extracting/modeling the possible feature dependencies, while performing frequency warping and integration. Such a model has several potential advantages, such as a more flexible modeling of the time/frequency structure of the speech signal. When working with spectral features, such a system can also perform nonlinear spectral warping, effectively implementing a form of nonlinear vocal tract normalization. Furthermore, it will be shown that HMM2 can be used to extract noise robust features, supposed to be related to formant regions, which can be used as extra features for traditional HMM recognizers to improve their performance. These issues are evaluated in the present paper, and different experimental results are reported on the Numbers95 database.

机译：本文介绍了一种新的HMM模型的理论基础和初步的实验结果。称为HMM2，可以视为HMM的混合物。在这个新模型中，通过在声学特征空间中工作的特定于状态的次要HMM估算了时间（主要）HMM的发射概率。因此，当主要HMM执行通常的时间扭曲和积分时，次要HMM负责在执行频率扭曲和积分的同时提取/建模可能的特征依赖性。这样的模型具有几个潜在的优点，例如对语音信号的时间/频率结构的更灵活的建模。当使用频谱特征时，这样的系统还可以执行非线性频谱扭曲，从而有效地实现非线性声道标准化的一种形式。此外，将显示HMM2可用于提取与共振峰区域有关的噪声鲁棒特征，这些特征可用作传统HMM识别器的额外特征，以改善其性能。本文对这些问题进行了评估，并在Numbers95数据库中报告了不同的实验结果。

著录项

来源
《Computer speech and language》 |2003年第3期|p.195-211|共17页
作者
Katrin Weber; Shajith Ikbal; Samy Bengio; Herve Bourlard;
展开▼
作者单位

IDIAP ― Dalle Molle Institute for Perceptual Artificial Intelligence, Rue du Simplon 4, Case Postale 592, 1920 Martigny, Switzerland;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Speech Features Extraction Techniques for Robust Emotional Speech Analysis/Recognition [J] . K. M. Shiva Prasad, G. N. Kodanda Ramaiah, M. B. Manjunatha Indian Journal of Science and Technology . 2017,第3期

机译：语音特征提取技术，用于健壮的情感语音分析/识别
2. Combining speech enhancement and auditory feature extraction for robust speech recognition [J] . Michael Kleinschmidt, Jurgen Tchorz, Birger Kollmeier Speech Communication . 2001,第1a2期

机译：结合语音增强和听觉特征提取以实现强大的语音识别
3. Noise robust speech recognition by integration of MLLR adaptation and feature extraction for noise reduced speech [J] . Masakiyo Fujimoto, Yasuo Ariki 電子情報通信学会技術研究報告. 音声. Speech . 2001,第522期

机译：通过集成MLLR自适应和特征提取以降低噪声的语音，增强了噪声鲁棒性
4. Increasing speech recognition robustness with HMM2 [C] . Weber, K., Bengio, . 2002

机译：使用HMM2提高语音识别的鲁棒性
5. Wavelet-based feature extraction for robust speech recognition. [D] . Walker, Shonda Lachelle. 2003

机译：基于小波的特征提取，可实现强大的语音识别。
6. A bio-inspired feature extraction for robust speech recognition [O] . Youssef Zouhir, Kaïs Ouni -1

机译：具有生物启发性的特征提取可实现强大的语音识别
7. Robust Speech Recognition and Feature Extraction Using HMM2 [O] . Katrin Weber, Shajith Ikbal, Samy Bengio, 2001

机译：基于Hmm2的鲁棒语音识别与特征提取

Robust speech recognition and feature extraction using HMM2

摘要

著录项

相似文献

相关主题

期刊订阅