Improving the Accuracy of the Speech Synthesis Based Phonetic Alignment Using Multiple Acoustic Features

机译：使用多个声学特征提高基于语音合成的语音对齐的准确性

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The phonetic alignment of the spoken utterances for speech research are commonly performed by HMM-based speech recognizers, in forced alignment mode, but the training of the phonetic segment models requires considerable amounts of annotated data. When no such material is available, a possible solution is to synthesize the same phonetic sequence and align the resulting speech signal with the spoken utterances. However, without a careful choice of acoustic features used in this procedure, it can perform poorly when applied to continuous speech utterances. In this paper we propose a new method to select the best features to use in the alignment procedure for each pair of phonetic segment classes. The results show that this selection considerably reduces the segment boundary location errors.

机译：用于语音研究的话语的语音对齐通常由基于HMM的语音识别器以强制对齐模式执行，但是语音段模型的训练需要大量注释数据。当没有此类材料可用时，一种可能的解决方案是合成相同的语音序列，并将结果语音信号与口头发音对齐。但是，如果不仔细选择此过程中使用的声学功能，则在应用于连续语音时，其性能可能会很差。在本文中，我们提出了一种新方法，用于为每对语音段类别选择在对齐过程中使用的最佳功能。结果表明，该选择大大减少了段边界位置误差。

著录项

来源
《6th International Workshop on Computational Processing of the Portuguese Language PROPOR 2003 Jun 26-27, 2003 Faro, Portugal》|2003年|p.31-39|共9页
会议地点 Faro(PT);Faro(PT)
作者
Sergio Paulo; Luis C. Oliveira;
展开▼
作者单位

L~2F Spoken Language Systems Lab. INESC-ID/IST Rua Alves Redol 9, 1000-029 Lisbon, Portugal;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自然科学研究方法;
关键词

相似文献

外文文献
中文文献
专利

1. Acoustic-Phonetic Approaches for Improving Segment-Based Speech Recognition for Large Vocabulary Continuous Speech [J] . Krerksak Likitsupin, Proadpran Punyabukkana, Chai Wutiwiwatchai, Engineering journal . 2016,第2期

机译：改进大词汇量连续语音基于片段的语音识别的声学方法
2. Phonetic alignment: speech synthesis-based vs. Viterbi-based [J] . F. Malfrere, O. Deroo, T. Dutoit, Speech Communication . 2003,第4期

机译：语音对齐：基于语音合成与基于维特比
3. ACOUSTIC-PHONETIC FEATURE BASED DIALECT IDENTIFICATION IN HINDI SPEECH [J] . Shweta Sinha, Aruna Jain, S. S. Agrawal International Journal on Smart Sensing and Intelligent Systems . 2015,第1期

机译：基于语音特征的印度语语音方言识别
4. Improving the Accuracy of the Speech Synthesis Based Phonetic Alignment Using Multiple Acoustic Features [C] . Sergio Paulo, Luis C. Oliveira International Workshop on Computational Processing of the Portuguese Language . 2003

机译：使用多个声学特征提高基于语音合成的语音对齐的准确性
5. Speech recognition based on phonetic features and acoustic landmarks. [D] . Juneja, Amit. 2004

机译：基于语音特征和声学界标的语音识别。
6. Improving accuracy of multiple sequence alignment algorithms based on alignment of neighboring residues [O] . Yue Lu, Sing-Hoi Sze 2009

机译：基于相邻残基比对提高多种序列比对算法的准确性
7. Improving the Accuracy of the Speech Synthesis Based Phonetic Alignment Using Multiple Acoustic Features [O] . Sérgio Paulo, Luís C. Oliveira 2003

机译：使用多个声学特征提高基于语音合成的语音对齐的准确性
8. Speech Recognition, Articulatory Feature Detection, and Speech Synthesis in Multiple Languages [R] . Ore, B. M. 2009

机译：语音识别，发音特征检测和多语言语音合成

Improving the Accuracy of the Speech Synthesis Based Phonetic Alignment Using Multiple Acoustic Features

摘要

著录项

相似文献

相关主题

期刊订阅