首页> 外文会议>3rd international universal communications symposium 2009 >Unit Selection Using k-Nearest Neighbor Search for Concatenative Speech Synthesis
【24h】

Unit Selection Using k-Nearest Neighbor Search for Concatenative Speech Synthesis

机译:使用k最近邻搜索进行级联语音合成的单位选择

获取原文
获取原文并翻译 | 示例

摘要

We propose a new approach to rapidly identifying adequate synthesis units in extremely large speech corpora. Our aim is to develop a concatenative speech synthesis system with high performance (both speech quality and throughput) for various practical applications. Utilizing very large speech corpora allows more natural sounding synthesized speech to be created; the downside is an increase in the time taken to locate the synthesis units needed. The key to overcoming this problem is introducing state-of-the art database retrieval technologies. The first selection step, based on simple hash search, tabulates all synthesis unit candidates. The second step selects N best candidates using nearest neighbor search, a typical database retrieval technique. Finally, the best sequence of synthesis units is determined by Viterbi search. A runtime measurement test and subjective experiment are carried out. Their results confirm that the proposed approach reduces the runtime by about 40% compared to using only hash search with no degradation in the quality of synthesized speech for a 15 hour corpus.
机译:我们提出了一种新方法,可以快速识别超大型语音语料库中的适当合成单元。我们的目标是为各种实际应用开发具有高性能(语音质量和吞吐量)的级联语音合成系统。使用非常大的语音语料库可以创建听起来更自然的合成语音。缺点是增加了定位所需合成单元所需的时间。解决此问题的关键是引入最新的数据库检索技术。第一步选择基于简单的哈希搜索,将所有合成单元候选列表化。第二步使用最近的邻居搜索(一种典型的数据库检索技术)选择N个最佳候选者。最后,通过维特比搜索确定最佳的合成单位序列。进行了运行时测量测试和主观实验。他们的结果证实,与仅使用散列搜索相比,在15小时语料库中合成语音的质量没有下降的情况下,所提出的方法将运行时间减少了约40%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号