首页> 外文会议>Annual Conference of the International Speech Communication Association >Automatic Measurement of Voice Onset Time and Prevoicing using Recurrent Neural Networks
【24h】

Automatic Measurement of Voice Onset Time and Prevoicing using Recurrent Neural Networks

机译:使用反复性神经网络自动测量语音发作时间和前进性

获取原文

摘要

Voice onset time (VOT) is defined as the time difference between the onset of the burst and the onset of voicing. When voicing begins preceding the burst, the stop is called prevoiced, and the VOT is negative. When voicing begins following the burst the VOT is positive. While most of the work on automatic measurement of VOT has focused on positive VOT mostly evident in American English, in many languages the VOT can be negative. We propose an algorithm that estimates if the stop is prevoiced, and measures either positive or negative VOT, respectively. More specifically, the input to the algorithm is a speech segment of an arbitrary length containing a single stop consonant, and the output is the time of the burst onset, the duration of the burst, and the time of the prevoicing onset with a confidence. Manually labeled data is used to train a recurrent neural network that can model the dynamic temporal behavior of the input signal, and outputs the events' onset and duration. Results suggest that the proposed algorithm is superior to the current state-of-the-art both in terms of the VOT measurement and in terms of prevoicing detection.
机译:语音发起时间(VOT)被定义为突发的开始与发起的发起的时差。当发起声音在突发之前,所谓的停止被称为,并且票数是否定的。当发出声音后,突发突发时,票数是积极的。虽然大部分工作的VOT自动测量都集中在美国英语中大多是明显的,但在许多语言中,票子可能是消极的。我们提出了一种估计停止的算法,分别估计停止,并分别测量正面或负直票。更具体地,算法的输入是包含单个停止辅音的任意长度的语音段,并且输出是突发开始的时间,突发的持续时间,以及具有置信度的前进的发起的时间。手动标记的数据用于训练可以模拟输入信号的动态时间行为的经常性神经网络,并输出事件的开始和持续时间。结果表明,在速度测量和前进检测方面,所提出的算法优于目前的现有技术。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号