首页> 美国政府科技报告 >Towards Co-Channel Speaker Separation by 2-D Demodulation of Spectrograms

【24h】

Towards Co-Channel Speaker Separation by 2-D Demodulation of Spectrograms

机译：通过频谱图的二维解调实现共通道扬声器分离

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper explores a two-dimensional (2-D) processing approach for co-channel speaker separation of voiced speech. We analyze localized time- frequency regions of a narrowband spectrogram using 2-D Fourier transforms and propose a 2-D amplitude modulation model based on pitch information for single and multi-speaker content in each region. Our model maps harmonically-related speech content to concentrated entities in a transformed 2-D space, thereby motivating 2-D demodulation of the spectrogram for analysis/synthesis and speaker separation. Using a priori pitch estimates of individual speakers, we show through a quantitative evaluation: 1) Utility of the model for representing speech content of a single speaker and 2) Its feasibility for speaker separation. For the separation task, we also illustrate benefits of the model's representation of pitch dynamics relative to a sinusoidal-based separation system.

著录项

作者
Wang, T. T.; Quatieri, T. F.;
展开▼
作者单位

展开▼
年度 2009
页码 1-5
总页数 5
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Speech analysis ; Two dimensional ; Spectrography ; Workshops ; Fourier transformation ; Separation;

机译：语音分析;二维;光谱学;研讨会;傅立叶变换;分离;

相似文献

外文文献
中文文献
专利

1. Performance Comparison Of 2-D DCT On Full/Block Spectrogram And 1-D DCT on Row Mean of Spectrogram for Speaker Identification [J] . H. B. Kekre, Prachi J. Natu, Shachi J. Natu, International Journal of Biometric and Bioinformatics . 2010,第3期

机译：完整/分组频谱图上的二维DCT和频谱图行均值上的1-D DCT用于说话人识别的性能比较
2. SPEAKER IDENTIFICATION USING 2-D DCT, WALSH AND HAAR ON FULL AND BLOCK SPECTROGRAM [J] . Dr. H. B. Kekre, Dr. Tanuja K. Sarode, Shachi J. Natu, International Journal on Computer Science and Engineering . 2010,第5期

机译：扬声器识别使用2-D DCT，WALSH和HAAR全部和块谱图
3. A Robust Spectral Correlation Technique for Text Dependent Speaker Identification under Co-Channel Multi-Speaker Conditions [J] . Aya S. Mostafa, Amr M. Gody, Tamer M. Barakat International Journal of Engineering Trends and Technology . 2016,第5期

机译：共通道多说话者条件下基于文本的说话人识别的鲁棒频谱相关技术
4. TOWARDS CO-CHANNEL SPEAKER SEPARATION BY 2-D DEMODULATION OF SPECTROGRAMS [C] . Tianyu T. Wang, Thomas F. Quatieri Workshop on Applications of Signal Processing to Audio and Acoustics . 2009

机译：朝向共信道扬声器分离，通过2-D解调谱图
5. Automatic Speaker Recognition and Diarization in Co-Channel Speech [D] . Shokouhi, Navid. 2017

机译：同频道语音中的说话人自动识别和区分
6. Long short-term memory for speaker generalization in supervised speech separation [O] . Jitong Chen, DeLiang Wang -1

机译：长时短时记忆用于监督语音分离中的说话人泛化
7. TOWARDS CO-CHANNEL SPEAKER SEPARATION BY 2-D DEMODULATION OF SPECTROGRAMS 1 [O] . Tianyu T. Wang 2013

机译：通过二维解码光谱分离共通道扬声器1

Towards Co-Channel Speaker Separation by 2-D Demodulation of Spectrograms

摘要

著录项

相似文献

相关主题

期刊订阅