Joint sparse representation based cepstral-domain dereverberation for distant-talking speech recognition

机译：基于联合稀疏表示的倒谱域去混响用于远距离语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we address reducing the mismatch between training and testing conditions for robust distant-talking speech recognition under realistic reverberant environments. It is well known that the distortions caused by reverberation, background noise, etc., are highly nonlinear in the cepstral domain. In this paper we propose to capture the complex relationships between clean and reverberant speech via joint dictionary learning. Given a test reverberant speech with a sequence of feature vectors we first find their sparse representations, and then estimate the underlying clean feature vectors using the dictionary of clean speech. Based on speech recognition experiments conducted under realistic reverberation conditions, the proposed method is shown to perform very well, resulting in an average relative improvement of 59.1% compared with the baseline front-ends.

机译：在本文中，我们解决了在真实混响环境下减少鲁棒的远距离语音识别的训练条件与测试条件之间的不匹配问题。众所周知，由混响，背景噪声等引起的失真在倒频谱域中是高度非线性的。在本文中，我们建议通过联合词典学习来捕获干净语音和混响语音之间的复杂关系。给定具有一系列特征向量的测试混响语音，我们首先找到它们的稀疏表示，然后使用纯净语音字典估计基本的纯净特征矢量。基于在真实混响条件下进行的语音识别实验，所提出的方法表现出很好的效果，与基准前端相比平均可提高59.1％。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2013年|7117-7120|共4页
会议地点
作者
Li Weifeng; Wang Longbiao; Zhou Fei; Liao Qingmin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Mel-Frequency Cepstral Coefficients (MFCCs); blind dereverberation; reverberation-robust speech recognition; sparse representation;

机译：Mel频率倒谱系数（MFCC）;盲去混响;混响鲁棒语音识别;稀疏表示;

相似文献

外文文献
中文文献
专利

1. Combination of bottleneck feature extraction and dereverberation for distant-talking speech recognition [J] . Ren Bo, Wang Longbiao, Lu Liang, Multimedia Tools and Applications . 2016,第9期

机译：瓶颈特征提取与去混响相结合，用于远距离语音识别
2. Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization [J] . Ueda Yuma, Wang Longbiao, Kai Atsuhiko, Journal of signal processing systems for signal, image, and video technology . 2016,第2期

机译：结合去噪自动编码器和时间结构归一化的单通道去混响用于远距离语音识别
3. Face Recognition Based on Sparse Representation and Joint Sparsity Model with Matrix Completion [J] . Inaba Fernando Kentaro, Salles Evandro Ottoni Teatini Latin America Transactions, IEEE (Revista IEEE America Latina) . 2012,第1期

机译：基于稀疏表示和矩阵稀疏联合稀疏模型的人脸识别
4. JOINT SPARSE REPRESENTATION BASED CEPSTRAL-DOMAIN DEREVERBERATION FOR DISTANT-TALKING SPEECH RECOGNITION [C] . Weifeng Li, Longbiao Wang, Fei Zhou, International Conference on Acoustics, Speech and Signal Processing . 2013

机译：基于联合稀疏表示的抗痉挛域DERERATERATION用于遥远的语音识别
5. Sparse and Low-Rank Representation-Based Methods for Multimodal Clustering and Recognition [D] . Abavisani, Mahdi. 2021

机译：基于稀疏和低秩的多模式聚类和识别的方法
6. Image Target Recognition via Mixed Feature-Based Joint Sparse Representation [O] . Xin Wang, Can Tang, Ji Li, 2020

机译：通过混合特征的关节稀疏表示图像目标识别
7. MODEL-BASED DEREVERBERATION IN THE LOGMELSPEC DOMAIN FOR ROBUST DISTANT-TALKING SPEECH RECOGNITION [O] . Armin Sehr, Walter Kellermann 2011

机译：LOGMELSPEC域中基于模型的去耦，用于鲁棒远程语音识别
8. Joint Sparse Representation for Robust Multimodal Biometrics Recognition. [R] . Patel, V. M., Nasrabadi, N. M., Chellappa, R., 2014

机译：鲁棒多模态生物特征识别的联合稀疏表示。

Joint sparse representation based cepstral-domain dereverberation for distant-talking speech recognition

摘要

著录项

相似文献

相关主题

期刊订阅