首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing;ICASSP 2009 >Maximizing the continuity in segmentation - A new approach to model, segment and recognize speech
【24h】

Maximizing the continuity in segmentation - A new approach to model, segment and recognize speech

机译:最大化分段的连续性-一种建模,分段和识别语音的新方法

获取原文

摘要

This paper presents a new approach to speech modeling and recognition. The new approach consists of a statistical model to represent up to sentence-long temporal dynamics in the training data, and an algorithm to identify the matching segments with maximum continuities between the training and testing sentences. Recognition is performed by combining the longest matching segments found from the training sentences. Because of their richer and more distinct temporal dynamics, longer speech segments as whole units can be recognized with lower error rates than shorter speech segments. Therefore basing recognition on the longest matching segments optimizes the discrimination and hence recognition of speech. The new approach has been evaluated on the TIMIT database for identifying matching speech segments. The results obtained are encouraging given the very low parametric complexity of the new model.
机译:本文提出了一种新的语音建模和识别方法。新方法包括一个统计模型,该模型可以在训练数据中表示句子中最长的时间动态,以及一种算法,该算法可以识别出训练和测试句子之间具有最大连续性的匹配句段。通过组合从训练句子中找到的最长匹配段来执行识别。由于它们具有更丰富,更独特的时间动态特性,因此与较短的语音段相比,可以将较长的语音段作为整体单元以较低的错误率识别。因此,基于最长匹配段的识别可优化辨别力,从而优化语音识别。已经在TIMIT数据库上对新方法进行了评估,以识别匹配的语音片段。鉴于新模型的参数复杂度非常低,获得的结果令人鼓舞。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号