首页> 外文期刊>Journal of the Association for Research in Otolaryngology: JARO >Predictions of Speech Chimaera Intelligibility Using Auditory Nerve Mean-Rate and Spike-Timing Neural Cues
【24h】

Predictions of Speech Chimaera Intelligibility Using Auditory Nerve Mean-Rate and Spike-Timing Neural Cues

机译:用听觉神经平均速率和穗定时神经线索预测智拉拉可识别

获取原文
获取原文并翻译 | 示例
           

摘要

Perceptual studies of speech intelligibility have shown that slow variations of acoustic envelope (ENV) in a small set of frequency bands provides adequate information for good perceptual performance in quiet, whereas acoustic temporal fine-structure (TFS) cues play a supporting role in background noise. However, the implications for neural coding are prone to misinterpretation because the mean-rate neural representation can contain recovered ENV cues from cochlear filtering of TFS. We investigated ENV recovery and spike-time TFS coding using objective measures of simulated mean-rate and spike-timing neural representations of chimaeric speech, in which either the ENV or the TFS is replaced by another signal. We (a) evaluated the levels of mean-rate and spike-timing neural information for two categories of chimaeric speech, one retaining ENV cues and the other TFS; (b) examined the level of recovered ENV from cochlear filtering of TFS speech; (c) examined and quantified the contribution to recovered ENV from spike-timing cues using a lateral inhibition network (LIN); and (d) constructed linear regression models with objective measures of mean-rate and spike-timing neural cues and subjective phoneme perception scores from normal-hearing listeners. The mean-rate neural cues from the original ENV and recovered ENV partially accounted for perceptual score variability, with additional variability explained by the recovered ENV from the LIN-processed TFS speech. The best model predictions of chimaeric speech intelligibility were found when both the mean-rate and spike-timing neural cues were included, providing further evidence that spike-time coding of TFS cues is important for intelligibility when the speech envelope is degraded.
机译:语音清晰度的感知研究表明,一小组频带中的声围(ENV)的缓慢变化提供了足够的信息,以便在安静中提供良好的感知性能,而声学时间细结构(TFS)提示在背景噪声中发挥支持作用。然而,对于神经编码的影响易于误解,因为平均速率神经表示可以含有来自TF的耳蜗滤波的恢复的ENV提示。我们研究了使用嵌合言语速率的模拟平均速率和尖峰定时神经表示的客观措施来研究ENV恢复和峰值时间TFS编码,其中env或TFS被另一个信号代替。我们(a)评估了两类ChimaExic语音的平均速率和穗状态神经信息的水平,保留了一个保持envice和其他TFS; (b)检查了TFS语音的耳蜗滤波中恢复的ENV水平; (c)使用横向抑制网络(Lin)检查并量化对从峰值定时提示恢复ENV的贡献; (d)与普通听觉听众的平均速率和尖峰定时神经线索和主观音素感知分数构建了线性回归模型。来自原始ENV的平均速率神经线索和恢复的ENV部分地占了感知分数变异性,具有由LIN处理的TFS语音的恢复的ENV解释的额外变异性。当包括平均速率和尖峰时序的神经线索时,发现了嵌合语音清晰度的最佳模型预测,提供了进一步的证据表明,当语音包络降解时,TFS线索的尖峰时间编码对于可懂度很重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号