Recognizing Cochlear Implant-like Spectrally Reduced Speech with HMM-based ASR: Experiments with MFCCs and PLP Coefficients

机译：基于HMM的ASR识别人工耳蜗般的频谱减少语音：MFCC和PLP系数的实验

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we investigate the recognition of cochlear implant-like spectrally reduced speech (SRS) using conventional speech features (MFCCs and PLP coefficients) and HMM-based ASR. The SRS was synthesized from subband temporal envelopes extracted from original clean speech for testing, whereas the acoustic models were trained on a different set of original clean speech signals of the same speech database. It was shown that changing the bandwidth of the subband temporal envelopes had no significant effect on the ASR word accuracy. In addition, increasing the number of frequency subbands of the SRS from 4 to 16 improved significantly the system performance. Furthermore, the ASR word accuracy attained with the original clean speech, by using both MFCC-based and PLP-based speech features, can be achieved by using the 16-, 24-, or 32-subband SRS. The experiments were carried out by using the Tl-digits speech database and the HTK speech recognition toolkit.

机译：在本文中，我们研究了使用常规语音特征（MFCC和PLP系数）和基于HMM的ASR对耳蜗状植入式频谱缩减语音（SRS）的识别。 SRS是从原始原始语音中提取的子带时间包络合成的，用于测试，而声学模型是在同一语音数据库的另一组原始原始语音信号上进行训练的。结果表明，改变子带时域包络的带宽对ASR字精度没有明显影响。此外，将SRS的子频带数量从4个增加到16个，可以显着改善系统性能。此外，通过同时使用基于MFCC和基于PLP的语音功能，可以通过使用16、24或32子带SRS来获得原始原始语音所获得的ASR字精度。通过使用T1位数语音数据库和HTK语音识别工具包进行了实验。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2010》|2011年|p.2642-2645|共4页
会议地点
作者
Cong-Thanh Do; Dominique Pastor; Gaeel Le Lan; Andre Goalic;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
spectrally reduced speech; subband temporal en- velope; cochlear implant; HMM-based ASR; MFCCs; PLP co- efficients;

机译：频谱减少的语音子带时间包络;人工耳蜗基于HMM的ASR; MFCC; PLP系数;

相似文献

外文文献
中文文献
专利

1. On the Recognition of Cochlear Implant-Like Spectrally Reduced Speech With MFCC and HMM-Based ASR [J] . Do C.-T., Pastor D., Goalic A. Audio, Speech, and Language Processing, IEEE Transactions on . 2010,第5期

机译：MFCC和基于HMM的ASR识别像人工耳蜗般的频谱减少语音
2. On Normalized MSE Analysis of Speech Fundamental Frequency in the Cochlear Implant-Like Spectrally Reduced Speech [J] . Do C.-T., Pastor D., Goalic A. Biomedical Engineering, IEEE Transactions on . 2010,第3期

机译：人工耳蜗样频谱减少语音中语音基本频率的归一化MSE分析
3. Robust Speech Recognition System Using Conventional and Hybrid Features of MFCC, LPCC, PLP, RASTA-PLP and Hidden Markov Model Classifier in Noisy Conditions [J] . Veton Z. K?puska, Hussien A. Elharati Journal of Computer and Communications . 2015,第6期

机译：噪声条件下使用MFCC，LPCC，PLP，RASTA-PLP和隐马尔可夫模型分类器的常规和混合特征的鲁棒语音识别系统
4. Recognizing Cochlear Implant-like Spectrally Reduced Speech with HMM-based ASR: Experiments with MFCCs and PLP Coefficients [C] . Cong-Thanh Do, Dominique Pastor, Gaeel Le Lan, Annual conference of the International Speech Communication Association . 2010

机译：识别基于HMM的ASR的触控式植入谱减小语音：MFCC和PLP系数的实验
5. Lexical tone development, music perception and speech perception in noise with cochlear implants: The effects of spectral resolution and spectral mismatch. [D] . Zhou, Ning. 2010

机译：人工耳蜗中噪声中的词汇音调发展，音乐感知和语音感知：频谱分辨率和频谱失配的影响。
6. Transfer of Auditory Perceptual Learning with Spectrally Reduced Speech to Speech and Nonspeech Tasks: Implications for Cochlear Implants [O] . Jeremy L. Loebach, David B. Pisoni, Mario A. Svirsky -1

机译：对听觉感知学习的转移与言语和非静音任务的频谱减少：对耳蜗植入的影响
7. On the recognition of cochlear implant-like spectrally reduced speech with MFCC and HMM-based ASR [O] . Cong-thanh Do, Dominique Pastor, Gaël Le Lan, 2010

机译：利用mFCC和基于Hmm的asR识别人工耳蜗类光谱减少语音

Recognizing Cochlear Implant-like Spectrally Reduced Speech with HMM-based ASR: Experiments with MFCCs and PLP Coefficients

摘要

著录项

相似文献

相关主题

期刊订阅