Mask estimate through Itakura-Saito nonnegative RPCA for speech enhancement

机译：通过Itakura-Saito非负RPCA进行掩码估计以增强语音

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Mask estimate is regarded as the main goal for using the computational auditory scene analysis method to enhance speech contaminated by noises. This paper presents extended robust principal component analysis (RPCA) methods, referred to as NRPCA and ISNRPCA, to estimate mask effectively. The perceptually motivated cochleagram is decomposed into sparse and low-rank components via NRPCA or ISNRPCA, which correspond to speech and noises, respectively. Different from the classical RPCA, NRPCA imposes nonnegative constraints to regularize the decomposed components. Furthermore, ISNRPCA uses the perceptually meaningful Itakura-Saito measure as its optimization objective function. We use the alternating direction method of multipliers to solve the corresponding optimization problem. NRPCA and ISNRPCA are totally unsupervised, neither speech nor noise model needs to be trained beforehand. Experimental results demonstrate that NRPCA and ISNRPCA show promising results for speech enhancement. With respect to state of the art baselines, the proposed methods achieve better performance on noises suppression and demonstrate at least comparable intelligibility and overall-quality.

机译：掩模估计被认为是使用计算听觉场景分析方法来增强被噪声污染的语音的主要目标。本文提出了扩展的鲁棒主成分分析（RPCA）方法，分别称为NRPCA和ISNRPCA，以有效地估计掩码。通过NRPCA或ISNRPCA将感知动机的耳蜗图分解为稀疏分量和低秩分量，分别对应于语音和噪声。与经典的RPCA不同，NRPCA施加非负约束来规范分解后的组件。此外，ISNRPCA使用感知上有意义的Itakura-Saito度量作为其优化目标函数。我们使用乘法器的交替方向方法来解决相应的优化问题。 NRPCA和ISNRPCA完全不受监督，不需要预先训练语音或噪声模型。实验结果表明，NRPCA和ISNRPCA在语音增强方面显示出令人鼓舞的结果。关于现有技术的基准，所提出的方法在抑制噪声方面取得了更好的性能，并证明了至少可比的清晰度和整体质量。

著录项

来源
《2016 IEEE International Workshop on Acoustic Signal Enhancement》|2016年|1-5|共5页
会议地点 Xian(CH)
作者
Gang Min; Xiongwei Zhang; Xia Zou; Meng Sun;
展开▼
作者单位

Lab of Intelligent Information Processing, PLA University of Science and Technology, Nanjing, China;

Lab of Intelligent Information Processing, PLA University of Science and Technology, Nanjing, China;

Lab of Intelligent Information Processing, PLA University of Science and Technology, Nanjing, China;

Lab of Intelligent Information Processing, PLA University of Science and Technology, Nanjing, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Speech; Speech enhancement; Linear programming; Optimization; Spectrogram; Principal component analysis; Noise measurement;

机译：语音;语音增强;线性编程;优化;频谱图;主成分分析;噪声测量;

相似文献

外文文献
中文文献
专利

1. Speech Enhancement Based on Discrete Wavelet Packet Transform and Itakura-Saito Nonnegative Matrix Factorisation [J] . Houguang LIU, Wenbo WANG, Lin XUE, Archives of acoustics . 2020,第4期

机译：基于离散小波包变换和Itakura-Saito非负矩阵分子的语音增强
2. Supervised Single Channel Speech Enhancement Based on Dual-Tree Complex Wavelet Transforms and Nonnegative Matrix Factorization Using the Joint Learning Process and Subband Smooth Ratio Mask [J] . Md Shohidul Islam, Tarek Hasan Al Mahmud, Wasim Ullah Khan, Electronics . 2019,第3期

机译：基于双树复小波变换和非负矩阵分解的联合学习过程和子带平滑率掩码的监督单通道语音增强
3. Enhancement of speech signal denoising based on MFCC and Robust Principal Component Analysis RPCA [J] . Sonia Moussa, Zied Hajaiej, Ali Garsallah International journal of computer science and network security . 2019,第3期

机译：基于MFCC和鲁棒主成分分析RPCA的语音信号去噪增强。
4. Mask estimate through Itakura-Saito nonnegative RPCA for speech enhancement [C] . Gang Min, Xiongwei Zhang, Xia Zou, IEEE International Workshop on Acoustic Signal Enhancement . 2016

机译：通过Itakura-Saito非负RPCA进行展示估计的语言增强
5. Speech enhancement algorithms using Kalman filtering and masking properties of human auditory systems. [D] . Ma, Ning. 2005

机译：使用卡尔曼滤波和人类听觉系统掩蔽属性的语音增强算法。
6. Estimating nonnegative matrix model activations with deep neural networks to increase perceptual speech quality [O] . Donald S. Williamson, Yuxuan Wang, DeLiang Wang -1

机译：使用深度神经网络估计非负矩阵模型的激活以提高感知语音质量
7. FEATURE ENHANCEMENT USING SPARSE REFERENCE AND ESTIMATED SOFT-MASK EXEMPLAR-PAIRS FOR NOISY SPEECH RECOGNITION [O] . Lee Ngee, Tan Abeer Alwan 2015

机译：使用稀疏参考和估计的软掩体示例对进行噪声识别的功能增强

Mask estimate through Itakura-Saito nonnegative RPCA for speech enhancement

摘要

著录项

相似文献

相关主题

期刊订阅