首页> 外文期刊>Computer speech and language >Approaching speech intelligibility enhancement with inspiration from Lombard and Clear speaking styles
【24h】

Approaching speech intelligibility enhancement with inspiration from Lombard and Clear speaking styles

机译:从伦巴底和清晰的说话风格中获得灵感,提高语音清晰度

获取原文
获取原文并翻译 | 示例
           

摘要

Lombard and Clear speech represent two acoustically and perceptually distinct speaking styles that humans employ to increase intelligibility. For Lombard speech, increased spectral energy in a band spanning the range of formants is consistent, effectively augmenting loudness, while vowel space expansion is exhibited in Clear speech, indicating greater articulation. On the other hand, analyses in the first part of this work illustrate that Clear speech does not exhibit significant spectral energy boosting, nor does the Lombard effect invoke an expansion of vowel space. Accordingly, though these two acoustic phenomena are largely attributed with the respective intelligibility gains of the styles, present analyses would suggest that they are mutually exclusive in human speech production. However, these phenomena can be used to inspire signal processing algorithms that seek to exploit and ultimately compound their respective intelligibility gains, as is explored in the second part of this work. While Lombard-inspired spectral shaping has been shown to successfully increase intelligibility, Clear speech-inspired modifications to expand vowel space are rarely explored. With this in mind, the latter part of this work focuses mainly on a novel frequency warping technique that is shown to achieve vowel space expansion. The frequency warping is then incorporated into an established Lombard-inspired Spectral Shaping method that pairs with dynamic range compression to maximize speech audibility (SSDRC). Finally, objective and subjective evaluations are presented in order to assess and compare the intelligibility gains of the different styles and their inspired modifications.
机译:朗伯和清晰的语音代表了人类用来提高清晰度的两种听觉和听觉上截然不同的说话风格。对于Lombard语音,在共振峰范围内的频带中增加的频谱能量是一致的,有效地提高了响度,而Clear语音中则显示了元音空间扩展,表明了清晰的发音。另一方面,这项工作的第一部分中的分析表明,清晰的语音没有表现出明显的频谱能量提升,伦巴第效应也没有引起元音空间的扩展。因此,尽管这两种声学现象在很大程度上归因于样式的各自的可理解性增益,但当前的分析表明它们在人类语音产生中是互斥的。但是,这些现象可以用来激发信号处理算法,这些算法试图利用并最终使它们各自的可懂度提高,正如本工作的第二部分所探讨的那样。伦巴第启发式的频谱整形已被证明可以成功地提高清晰度,但很少有人探索清晰的语音启发式修饰以扩大元音空间。考虑到这一点,本文的后半部分主要集中在一种新颖的频率扭曲技术上,该技术被证明可以实现元音空间的扩展。然后将频率扭曲合并到已建立的Lombard启发式频谱整形方法中,该方法与动态范围压缩配对以最大程度地提高语音可听性(SSDRC)。最后,提出了客观和主观的评估,以评估和比较不同风格及其启发性修改的清晰度。

著录项

  • 来源
    《Computer speech and language》 |2014年第2期|629-647|共19页
  • 作者单位

    Institute of Computer Science, Foundation for Research and Technology Hellas, Crete, Greece;

    Institute of Computer Science, Foundation for Research and Technology Hellas, Crete, Greece,Multimedia Informatics Lab, Computer Science Department, University of Crete, Greece;

    Institute of Computer Science, Foundation for Research and Technology Hellas, Crete, Greece,Multimedia Informatics Lab, Computer Science Department, University of Crete, Greece;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Lombard effect; Clear speech; Intelligibility enhancement;

    机译:伦巴第效应;语言清晰;增强清晰度;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号