...
首页> 外文期刊>The Journal of the Acoustical Society of America >The role of auditory spectro-temporal modulation filtering and the decision metric for speech intelligibility prediction
【24h】

The role of auditory spectro-temporal modulation filtering and the decision metric for speech intelligibility prediction

机译:听觉时空调制滤波的作用和语音清晰度预测的决策指标

获取原文
获取原文并翻译 | 示例
           

摘要

Speech intelligibility models typically consist of a preprocessing part that transforms stimuli into some internal (auditory) representation and a decision metric that relates the internal representation to speech intelligibility. The present study analyzed the role of modulation filtering in the preprocessing of different speech intelligibility models by comparing predictions from models that either assume a spectro-temporal (i.e., two-dimensional) or a temporal-only (i.e., one-dimensional) modulation filterbank. Furthermore, the role of the decision metric for speech intelligibility was investigated by comparing predictions from models based on the signal-to-noise envelope power ratio, SNR_(env), and the modulation transfer function, MTF. The models were evaluated in conditions of noisy speech (1) subjected to reverberation, (2) distorted by phase jitter, or (3) processed by noise reduction via spectral subtraction. The results suggested that a decision metric based on the SNR_(env) may provide a more general basis for predicting speech intelligibility than a metric based on the MTF. Moreover, the one-dimensional modulation filtering process was found to be sufficient to account for the data when combined with a measure of across (audio) frequency variability at the output of the auditory preprocessing. A complex spectro-temporal modulation filterbank might therefore not be required for speech intelligibility prediction.
机译:语音清晰度模型通常由将刺激转换为某种内部(听觉)表示的预处理部分和将内部表示与语音清晰度相关的决策度量组成。本研究通过比较模型的预测来分析调制滤波在不同语音清晰度模型的预处理中的作用,这些模型假设是时变的(即二维)或仅时变的(即一维)调制滤波器组。此外,通过比较基于信噪包络功率比SNR_(env)和调制传递函数MTF的模型的预测,研究了决策度量在语音清晰度方面的作用。在嘈杂语音条件下(1)进行混响,(2)由于相位抖动而失真,或(3)通过频谱减法降噪处理,对模型进行了评估。结果表明,与基于MTF的度量相比,基于SNR_(env)的决策度量可以为预测语音清晰度提供更通用的基础。此外,发现一维调制滤波过程足以与听觉预处理输出处的跨(音频)频率可变性度量结合使用来说明数据。因此,语音清晰度预测可能不需要复杂的频谱时间调制滤波器组。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号