The role of auditory spectro-temporal modulation filtering and the decision metric for speech intelligibility prediction

Alexandre Chabot-Leclerc; S?ren J?rgensen; Torsten Dau

首页> 外文期刊>The Journal of the Acoustical Society of America >The role of auditory spectro-temporal modulation filtering and the decision metric for speech intelligibility prediction

【24h】

The role of auditory spectro-temporal modulation filtering and the decision metric for speech intelligibility prediction

机译：听觉时空调制滤波的作用和语音清晰度预测的决策指标

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech intelligibility models typically consist of a preprocessing part that transforms stimuli into some internal (auditory) representation and a decision metric that relates the internal representation to speech intelligibility. The present study analyzed the role of modulation filtering in the preprocessing of different speech intelligibility models by comparing predictions from models that either assume a spectro-temporal (i.e., two-dimensional) or a temporal-only (i.e., one-dimensional) modulation filterbank. Furthermore, the role of the decision metric for speech intelligibility was investigated by comparing predictions from models based on the signal-to-noise envelope power ratio, SNR_(env), and the modulation transfer function, MTF. The models were evaluated in conditions of noisy speech (1) subjected to reverberation, (2) distorted by phase jitter, or (3) processed by noise reduction via spectral subtraction. The results suggested that a decision metric based on the SNR_(env) may provide a more general basis for predicting speech intelligibility than a metric based on the MTF. Moreover, the one-dimensional modulation filtering process was found to be sufficient to account for the data when combined with a measure of across (audio) frequency variability at the output of the auditory preprocessing. A complex spectro-temporal modulation filterbank might therefore not be required for speech intelligibility prediction.

机译：语音清晰度模型通常由将刺激转换为某种内部（听觉）表示的预处理部分和将内部表示与语音清晰度相关的决策度量组成。本研究通过比较模型的预测来分析调制滤波在不同语音清晰度模型的预处理中的作用，这些模型假设是时变的（即二维）或仅时变的（即一维）调制滤波器组。此外，通过比较基于信噪包络功率比SNR_（env）和调制传递函数MTF的模型的预测，研究了决策度量在语音清晰度方面的作用。在嘈杂语音条件下（1）进行混响，（2）由于相位抖动而失真，或（3）通过频谱减法降噪处理，对模型进行了评估。结果表明，与基于MTF的度量相比，基于SNR_（env）的决策度量可以为预测语音清晰度提供更通用的基础。此外，发现一维调制滤波过程足以与听觉预处理输出处的跨（音频）频率可变性度量结合使用来说明数据。因此，语音清晰度预测可能不需要复杂的频谱时间调制滤波器组。

著录项

来源
《The Journal of the Acoustical Society of America》 |2014年第6期|共11页
作者
Alexandre Chabot-Leclerc; S?ren J?rgensen; Torsten Dau;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
prediction; conditions; results;

机译：预测;条件;结果;

相似文献

外文文献
中文文献
专利

1. The role of auditory spectro-temporal modulation filtering and the decision metric for speech intelligibility prediction [J] . Alexandre Chabot-Leclerc, S?ren J?rgensen, Torsten Dau The Journal of the Acoustical Society of America . 2014,第6期

机译：听觉时空调制滤波的作用和语音清晰度预测的决策指标
2. Auditory motivated front-end for noisy speech using spectro-temporal modulation filtering [J] . Ganapathy Sriram, Omar Mohamed The Journal of the Acoustical Society of America . 2014,第5期

机译：使用频谱时间调制滤波的听觉激励式前端，用于嘈杂的语音
3. Speech intelligibility prediction with the dynamic compressive gammachirp filterbank and modulation power spectrum [J] . Katsuhiko Yamamoto, Toshio Irino, Toshie Matsui, Acoustical science and technology . 2019,第2期

机译：用动态压缩Gammachirp滤波器和调制功率谱预测语音可懂语预测
4. Automatic intelligibility assessment of pathologic speech in head and neck cancer based on auditory-inspired spectro-temporal modulations [C] . Xinhui Zhou, Daniel Garcia-Romero, Nima Mesgarani, Annual conference of the International Speech Communication Association . 2012

机译：基于听觉启发的光谱时空调制的头颈癌病理性言语自动清晰度评估
5. Investigation of in-vehicle speech intelligibility metrics for normal hearing and hearing impaired listeners. [D] . Samardzic, Nikolina. 2013

机译：针对正常听觉和听力障碍听众的车载语音清晰度指标的调查。
6. Predictions of Speech Chimaera Intelligibility Using Auditory Nerve Mean-Rate and Spike-Timing Neural Cues [O] . Michael R. Wirtzfeld, Rasha A. Ibrahim, Ian C. Bruce 2017

机译：使用听觉神经平均速率和加标定时神经线索预测言语Chimaera清晰度
7. Speech intelligibility prediction with the dynamic compressive gammachirp filterbank and modulation power spectrum [O] . Katsuhiko Yamamoto, Toshio Irino, Toshie Matsui, 2019

机译：用动态压缩Gammachirp滤波器和调制功率谱预测语音可懂语预测
8. Spectro-Temporal Modulation Transfer Functions and Speech Intelligibility. [R] . Chi, T., Gao, Y., Guyton, M. C., 1999

机译：分光 - 时间调制传递函数和语音清晰度。

The role of auditory spectro-temporal modulation filtering and the decision metric for speech intelligibility prediction

摘要

著录项

相似文献

相关主题

期刊订阅