Speech energy redistribution for intelligibility improvement in noise based on a perceptual distortion measure

Cees H. Taal; Richard C. Hendriks; Richard Heusdens

首页> 外文期刊>Computer speech and language >Speech energy redistribution for intelligibility improvement in noise based on a perceptual distortion measure

【24h】

Speech energy redistribution for intelligibility improvement in noise based on a perceptual distortion measure

机译：基于感知失真测度的语音能量重新分配，以提高噪声的清晰度

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A speech pre-processing algorithm is presented that improves the speech intelligibility in noise for the near-end listener. The algorithm improves intelligibility by optimally redistributing the speech energy over time and frequency according to a perceptual distortion measure, which is based on a spectro-temporal auditory model. Since this auditory model takes into account short-time information, transients will receive more amplification than stationary vowels, which has been shown to be beneficial for intelligibility of speech in noise. The proposed method is compared to unprocessed speech and two reference methods using an intelligibility listening test. Results show that the proposed method leads to significant intelligibility gains while still preserving quality. Although one of the methods used as a reference obtained higher intelligibility gains, this happened at the cost of decreased quality. Matlab code is provided.

机译：提出了一种语音预处理算法，可提高近端听众在噪声中的语音清晰度。该算法通过根据基于时空听觉模型的感知失真度量在时间和频率上最佳地重新分配语音能量来提高清晰度。由于此听觉模型考虑了短时信息，因此瞬态将比固定元音获得更多的放大，这已被证明对噪声中语音的可理解性是有益的。使用可听度测试，将该方法与未处理语音和两种参考方法进行了比较。结果表明，所提出的方法在保持质量的同时，还可以明显提高清晰度。尽管作为参考的一种方法获得了更高的清晰度，但这是以降低质量为代价的。提供了Matlab代码。

著录项

来源
《Computer speech and language》 |2014年第4期|858-872|共15页
作者
Cees H. Taal; Richard C. Hendriks; Richard Heusdens;
展开▼
作者单位

Leiden University Medical Center, ENT-Department, 2300 RC Leiden, The Netherlands;

Delft University of Technology, Signal Information & Processing Lab, Mekelweg 4, 2628 CD Delft, The Netherlands;

Delft University of Technology, Signal Information & Processing Lab, Mekelweg 4, 2628 CD Delft, The Netherlands;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Near-end speech enhancement; Intelligibility improvement; Transients;

机译：近端语音增强;清晰度提高;暂态;

相似文献

外文文献
中文文献
专利

1. Relations between perceptual measures of temporal processing, auditory-evoked brainstem responses and speech intelligibility in noise. [J] . Papakonstantinou A, Strelcyk O, Dau T Hearing Research: An International Journal . 2011,第1a2期

机译：时间处理，听觉诱发的脑干反应和语音清晰度之间的关系。
2. Near and Far Field Speech-in-Noise Intelligibility Improvements Based on a Time–Frequency Energy Reallocation Approach [J] . Tudor-Cătălin Zorilă, Yannis Stylianou, Tatsuma Ishihara, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第10期

机译：基于时频能量分配方法的近场和远场噪声中的语音清晰度改善
3. Combining perceptually-motivated spectral shaping with loudness and duration modification for intelligibility enhancement of HMM-based synthetic speech in noise [J] . Cassia VALENTINI-BOTINHAO, Junichi YAMAGISHI, Simon KING, 電子情報通信学会技術研究報告. 音声. Speech . 2013,第76期

机译：将感知动机的频谱整形与响度和持续时间修改相结合，以提高基于HMM的合成语音在噪声中的清晰度
4. A speech preprocessing strategy for intelligibility improvement in noise based on a perceptual distortion measure [C] . Taal, Cees H. IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP . 2012

机译：一种基于感知失真测度的噪声清晰度提高的语音预处理策略
5. Multisensor segmentation-based noise suppression for intelligibility improvement in MELP coders. [D] . Demiroglu, Cenk. 2006

机译：基于多传感器分段的噪声抑制，可提高MELP编码器的清晰度。
6. Analysis of a simplified normalized covariance measure based on binary weighting functions for predicting the intelligibility of noise-suppressed speech [O] . Fei Chen, Philipos C. Loizou -1

机译：基于二进制加权函数的简化归一化协方差度量分析用于预测受噪声抑制的语音的清晰度
7. A speech preprocessing strategy for intelligibility improvement in noise based on a perceptual distortion measure [O] . Cees H. Taal, Richard C. Hendriks, Richard Heusdens 2012

机译：一种基于感知失真度量的噪声可懂度改进的语音预处理策略
8. Improvement of Speech Intelligibility in Noise. Development and Evaluation of a New Directional Hearing Instrument Based on Array Technology [R] . Soede, W. 1990

机译：噪声中语音清晰度的提高。基于阵列技术的新型定向听力仪的研制与评价

Speech energy redistribution for intelligibility improvement in noise based on a perceptual distortion measure

摘要

著录项

相似文献

相关主题

期刊订阅