A new feature set for masking-based monaural speech separation

机译：用于基于掩蔽的单声道语音分离的新功能

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a new feature based on a gammatone filter bank for improving monaural speech separation using neural networks. This new feature encodes not only the local information of cochleagram, and spectrotemporal context, similar to previous approaches, but also captures time-frequency dynamics in the spectrotemporal context using an image processing technique. Speech separation was achieved by computing optimal time-frequency masks using two types of neural networks (DNN and LSTM) to determine the interactions between feature and training model properties. The performance of our feature was evaluated in a variety of simulated environments having different non-stationary noises and reverberation times and quantified using three objective measures. Experimental results show that the proposed monaural feature set improves the objective speech intelligibility, speech quality and signal-to-noise ratio compared to prior feature sets in noisy and reverberant environments with particular benefit in speech intelligibility.

机译：我们提出了一种基于伽马托滤波器的新功能，用于使用神经网络改善单声道语音分离。该新功能不仅可以编码Cochleagram的本地信息，以及类似于先前的方法，而且使用图像处理技术捕获光谱仪器上下文中的时间频率动态。通过使用两种类型的神经网络（DNN和LSTM）计算最佳时频掩模来实现语音分离，以确定特征和训练模型属性之间的交互。我们特征的性能被评估在各种模拟环境中，具有不同的非平稳噪声和混响时间，并使用三个客观措施量化。实验结果表明，与嘈杂和混响环境中的先前特征集相比，所提出的单声道特征组可提高目标语音可懂度，语音质量和信噪比，特别是语音可懂度特别益处。

著录项

来源
《Asilomar Conference on Signals, Systems, and Computers》|2018年|xxxii 746 p. :|共5页
会议地点
作者
Shadi Pirhosseinloo; Jonathan S. Brumberg;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类总体结构、系统结构;
关键词
Training; Noise measurement; Neural networks; Speech processing; Time-frequency analysis; Reverberation; Feature extraction;

机译：训练;噪声测量;神经网络;语音处理;时频分析;混响;特征提取;

相似文献

外文文献
中文文献
专利

1. Features for Masking-Based Monaural Speech Separation in Reverberant Conditions [J] . Masood Delfarah, DeLiang Wang Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2017,第5期

机译：混响条件下基于蒙版的单声道语音分离的功能
2. Monaural speech separation based on MAXVQ and CASA for robust speech recognition [J] . Peng Li, Yong Guan, Shijin Wang, Computer speech and language . 2010,第1期

机译：基于MAXVQ和CASA的单声道语音分离可增强语音识别能力
3. Monaural Speech Separation Based on Computational Auditory Scene Analysis and Objective Quality Assessment of Speech [J] . Li P., Guan Y., Xu B., IEEE transactions on audio, speech and language processing . 2006,第6期

机译：基于计算听觉场景分析和语音客观质量评估的单声道语音分离
4. A new feature set for masking-based monaural speech separation [C] . Shadi Pirhosseinloo, Jonathan S. Brumberg 2018 52nd Asilomar Conference on Signals, Systems, and Computers . 2018

机译：基于蒙版的单声道语音分离的新功能集
5. Kinematic measurement and feature sets for automatic speech recognition. [D] . Fain, Daniel Clark. 2001

机译：运动学测量和功能集，用于自动语音识别。
6. Complex Ratio Masking for Monaural Speech Separation [O] . Donald S. Williamson, Yuxuan Wang, DeLiang Wang -1

机译：用于单声道语音分离的复数比率掩蔽
7. NMF based speech and music separation in monaural speech recordings with sparseness and temporal continuity constraints [O] . Tu Ming, Xie Xiang, Jiao Yishan 2013

机译：基于NMF的语音和音乐分离在单声道语音记录中，具有稀疏性和时间连续性约束
8. Deep Ensemble Learning for Monaural Speech Separation. [R] . Wang, D. 2015

机译：单声道语音分离的深度集成学习。

A new feature set for masking-based monaural speech separation

摘要

著录项

相似文献

相关主题

期刊订阅