Cochannel Speech Segregation with Sparse Coding

机译：稀疏编码的同信道语音分离

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Most of the computational auditory scene analysis (CASA) based systems rely on pitch based features. When we go for cochannel speech segregation, two speakers are involved. Pitch ranges for male speech and female speech overlap to a large extent. Therefore multi-pitch tracking becomes a nontrivial task. In case of same gender mixtures, again pitch tracking becomes harder. Considering this fact, we should go for some reliable features. Here we propose a cochannel speech segregation system with sparsity based features. Sparse coding is applied on the cochleagram of the signal to get sparse approximation coefficients using pre-trained dictionaries for speakers. We treat sparse approximation coefficients the features because these are selected from the speaker specific dictionaries to represent an input signal. Sparse approximation coefficients are good choice for finding binary masks. Speech waveform is resynthesized from the masked cochleagram of the mixture. Experimental results show that the proposed method produces better objective intelligibility scores than the baseline system.

机译：大多数基于计算听觉场景分析（CASA）的系统都依赖于基于音高的功能。当我们进行同频道语音分离时，涉及到两个发言人。男性语音和女性语音的音调范围在很大程度上重叠。因此，多音高跟踪成为一项艰巨的任务。在性别相同的情况下，音调跟踪也会变得更加困难。考虑到这一事实，我们应该选择一些可靠的功能。在这里，我们提出了一种基于稀疏特征的同信道语音分离系统。使用针对扬声器的预训练词典，对信号的耳蜗图进行稀疏编码，以获得稀疏的近似系数。我们将稀疏近似系数视为特征，因为这些特征是从扬声器特定词典中选择的，以表示输入信号。稀疏近似系数是找到二进制掩码的不错选择。语音波形从混合物的屏蔽耳蜗图重新合成。实验结果表明，与基线系统相比，该方法产生了更好的客观清晰度得分。

著录项

来源
《International Conference on Electrical, Electronics, and Optimization Techniques》|2016年|4589-4592|共4页
会议地点 Chennai(IN)
作者
Pallavi P. Ingale; S. L. Nalbalwar;
展开▼
作者单位

Department of Electronics and Telecommunication Engineering, Dr. Babasaheb Ambedkar Technological University, Lonere, India;

Department of Electronics and Telecommunication Engineering, Dr. Babasaheb Ambedkar Technological University, Lonere, India;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Speech; Dictionaries; Matching pursuit algorithms; Speech coding; Speech enhancement; Feature extraction;

机译：语音;词典;匹配追踪算法;语音编码;语音增强;特征提取;

相似文献

外文文献
中文文献
专利

1. Sparse Codes for Speech Predict Spectrotemporal Receptive Fields in the Inferior Colliculus [J] . Nicole L. Carlson, Vivienne L. Ming, Michael Robert DeWeese PLoS Computational Biology . 2012,第7期

机译：语音的稀疏代码预测下腔囊中的光谱时域接受场
2. Speech Segregation Using an Auditory Vocoder With Event-Synchronous Enhancements [J] . Irino T., Patterson R.D., Kawahara H. IEEE transactions on audio, speech and language processing . 2006,第6期

机译：使用具有事件同步增强功能的听觉声码器进行语音分离
3. Speech segregation based on fundamental periodicity using auditory vocoder [J] . Toshio Irino, Roy D. Patterson, Hideki Kawahara 電子情報通信学会技術研究報告. 音声. Speech . 2003,第155期

机译：使用听觉声码器基于基本周期性的语音分离
4. Cochannel Speech Segregation with Sparse Coding [C] . Pallavi P. Ingale, S. L. Nalbalwar International Conference on Electrical, Electronics, and Optimization Techniques . 2016

机译：Cochannel语音隔离与稀疏编码
5. Sparse Coding of Speech Data Predicts Properties of the Early Auditory System. [D] . Carlson, Nicole Liu. 2012

机译：语音数据的稀疏编码可预测早期听觉系统的属性。
6. Sparse Codes for Speech Predict Spectrotemporal Receptive Fields in the Inferior Colliculus [O] . Nicole L. Carlson, Vivienne L. Ming, Michael Robert DeWeese 2012

机译：语音的稀疏代码预测下眼囊的光谱时域接受场
7. Retrieving Sparse Patterns Using a Compressed Sensing Framework:Applications to Speech Coding Based on Sparse Linear Prediction [O] . Giacobello, Daniele, Christensen, Mads Græsbøll, Murthi, Manohar, 2010

机译：利用压缩感知框架检索稀疏模式：基于稀疏线性预测的语音编码应用
8. Joint Space-Time Coded Modulation and Channel Coding over Fading Channels with Cochannel Interference [R] . Haimovich, A. M. , Lao, D. 2003

机译：具有同信道干扰的衰落信道上的联合空时编码调制和信道编码

Cochannel Speech Segregation with Sparse Coding

摘要

著录项

相似文献

相关主题

期刊订阅