首页> 外文期刊>Malaysian Journal of Computer Science >The Effect Of Changes In Speech Features On The Recognition Accuracy Of ASR System: A Study On The Malay Speech Impaired Children
【24h】

The Effect Of Changes In Speech Features On The Recognition Accuracy Of ASR System: A Study On The Malay Speech Impaired Children

机译:语音特征变化对ASR系统识别准确度的影响:马来语障碍儿童的研究

获取原文
           

摘要

Speech impairments refers to disability that causes the human speech production to deviate from the norm. Although there have been several researches undertaken to identify the differences between non-impaired and impaired speech, little is known about their effects on the speech intelligibility and the performance of ASR systems in recognizing impaired speech of children. This study investigates the speech features of impaired speech in relation to intelligibility deficits and degradation in ASR performance; which includes, formant frequencies, intensity, fundamental frequency (F0) and perturbation features such as jitter and shimmer. As there is no existing speech database for performing the evaluation, we have developed a speech database of speech impaired children and have analysed the impaired speech features. We have identified significant differences in the selected features. We also have identified the relationship between the ASR system Word Error Rate (WER) of impaired speeches with the speech features. The results show that there are significant differences in F0, jitter and shimmer across the Control Group (CG) and the Speech Impaired Group (SIG). This paper explains the differences between impaired speeches and non-impaired speeches that can be used in developing automated speech recognition system. We have observed that F0 affects the ASR performance and was found to be a significant predictor that influences the accuracy of vowel phonemes /e/ and /u/.
机译:言语障碍是指导致人类言语产生偏离规范的残疾。尽管已经进行了几项研究来确定非弱能和受损语音之间的差异,但对于它们对语音清晰度和ASR系统识别儿童语音受损的性能的影响知之甚少。这项研究调查了语音障碍与语音清晰度和ASR性能下降有关的语音特征;其中包括共振峰频率,强度,基频(F0)和扰动特征(例如抖动和微光)。由于没有现存的语音数据库来进行评估,因此我们开发了语音障碍儿童语音数据库,并分析了语音障碍特征。我们发现所选功能存在明显差异。我们还确定了受损语音的ASR系统单词错误率(WER)与语音功能之间的关系。结果表明,在整个控制组(CG)和语音障碍组(SIG)中,F0,抖动和闪光存在显着差异。本文解释了可用于开发自动语音识别系统的受损语音和非受损语音之间的差异。我们已经观察到F0影响ASR性能,并且被发现是影响元音音素/ e /和/ u /准确性的重要预测因子。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号