首页> 外文期刊>Computer speech and language >Adjusting dysarthric speech signals to be more intelligible
【24h】

Adjusting dysarthric speech signals to be more intelligible

机译:调整发音异常的语音信号使其更清晰

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents a system that transforms the speech signals of speakers with physical speech disabilities into a more intelligible form that can be more easily understood by listeners. These transformations are based on the correction of pronunciation errors by the removal of repeated sounds, the insertion of deleted sounds, the devoicing of unvoiced phonemes, the adjustment of the tempo of speech by phase vocoding, and the adjustment of the frequency characteristics of speech by anchor-based morphing of the spectrum. These transformations are based on observations of disabled articulation including improper glottal voicing, lessened tongue movement, and lessened energy produced by the lungs. This system is a substantial step towards full automation in speech transformation without the need for expert or clinical intervention. Among human listeners, recognition rates increased up to 191% (from 21.6% to 41.2%) relative to the original speech by using the module that corrects pronunciation errors. Several types of modified dysarthric speech signals are also supplied to a standard automatic speech recognition system. In that study, the proportion of words correctly recognized increased up to 121 % (from 72.7% to 87.9%) relative to the original speech, across various parameterizations of the recognizer. This represents a significant advance towards human-to-human assistive communication software and human-computer interaction.
机译:本文提出了一种系统,该系统可将具有肢体语言障碍的说话者的语音信号转换为更易于理解的形式,使听众更容易理解。这些转换的基础是通过消除重复的声音,插入已删除的声音,清除清音音素,通过相位声码调整语音速度以及通过调整语音频率特性来纠正发音错误。基于锚的频谱变形。这些转换是基于对关节活动不清的观察,包括不正确的声门发声,舌头运动减少和肺部产生的能量减少。该系统是实现语音转换完全自动化的实质性步骤,无需专家或临床干预。在人类听众中,通过使用纠正发音错误的模块,相对于原始语音,识别率提高了191%(从21.6%增至41.2%)。几种类型的修改后的构音障碍语音信号也被提供给标准的自动语音识别系统。在该研究中,在识别器的各种参数设置中,相对于原始语音,正确识别的单词比例增加了121%(从72.7%增至87.9%)。这代表了在人对人的辅助通信软件和人机交互方面的重大进步。

著录项

  • 来源
    《Computer speech and language》 |2013年第6期|1163-1177|共15页
  • 作者

    Frank Rudzicz;

  • 作者单位

    Department of Computer Science, University of Toronto, 10 King's College Road, Room 3302 Toronto, Ontario M5S 3G4, Canada;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Speech transformation; Dysarthria; Intelligibility;

    机译:语音转换;构音障碍;可理解性;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号