首页> 美国卫生研究院文献>The Journal of the Acoustical Society of America >Comparison of a short-time speech-based intelligibility metric to the speech transmission index and intelligibility data
【2h】

Comparison of a short-time speech-based intelligibility metric to the speech transmission index and intelligibility data

机译:短时基于语音的清晰度度量与语音传输指数和清晰度数据的比较

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Several algorithms have been shown to generate a metric corresponding to the Speech Transmission Index (STI) using speech as a probe stimulus [e.g., Goldsworthy and Greenberg, J. Acoust. Soc. Am. >116, 3679–3689 (2004)]. The time-domain approaches work well on long speech segments and have the added potential to be used for short-time analysis. This study investigates the performance of the Envelope Regression (ER) time-domain STI method as a function of window length, in acoustically degraded environments with multiple talkers and speaking styles. The ER method is compared with a short-time Theoretical STI, derived from octave-band signal-to-noise ratios and reverberation times. For windows as short as 0.3 s, the ER method tracks short-time Theoretical STI changes in stationary speech-shaped noise, fluctuating restaurant babble and stationary noise plus reverberation. The metric is also compared to intelligibility scores on conversational speech and speech articulated clearly but at normal speaking rates (Clear/Norm) in stationary noise. Correlation between the metric and intelligibility scores is high and, consistent with the subject scores, the metrics are higher for Clear/Norm speech than for conversational speech and higher for the first word in a sentence than for the last word.
机译:已经示出了几种算法来使用语音作为探测刺激来生成与语音传输指数(STI)相对应的度量[例如,Goldsworthy和Greenberg,J.Acoust。 Soc。上午。 > 116 ,3679–3689(2004)]。时域方法在长语音段上效果很好,并且具有用于短时分析的附加潜力。这项研究调查了在具有多个说话者和多种说话风格的声学退化环境中,包络回归(ER)时域STI方法的性能与窗口长度的关系。 ER方法与短时理论STI进行了比较,后者是从倍频带的信噪比和混响时间得出的。对于短至0.3?s的窗户,ER方法会跟踪短期的理论STI变化,包括固定的语音形噪声,波动的餐厅杂音和固定的噪声以及混响。还将度量标准与对话语音和清晰表达的语音的可懂度得分进行比较,但是在固定噪音下以正常说话率(清晰/正常)进行。度量和清晰度分数之间的相关性很高,并且与主题分数一致,“清除/规范”语音的度量高于对话语音,并且句子中第一个单词的度量高于最后一个单词。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号