首页> 外国专利> Speech synthesis apparatus, speech synthesis method, speech synthesis program, speech synthesis model learning apparatus, speech synthesis model learning method, and speech synthesis model learning program

Speech synthesis apparatus, speech synthesis method, speech synthesis program, speech synthesis model learning apparatus, speech synthesis model learning method, and speech synthesis model learning program

机译:语音合成装置,语音合成方法,语音合成程序,语音合成模型学习装置,语音合成模型学习方法和语音合成模型学习程序

摘要

Prevents speech degradation and unnatural phoneme duration. The speech synthesis apparatus according to the embodiment includes a storage unit, a creation unit, a determination unit, a generation unit, and a waveform generation unit. The storage unit stores, as statistical model information, an output distribution of acoustic feature parameters including pitch feature parameters and a duration distribution based on time parameters in each state of a statistical model having a plurality of states. The creation unit creates a statistical model sequence from the context information corresponding to the input text and the statistical model information. The determination unit determines the number of pitch waveforms in each state using the duration information based on the duration distribution of each state of each statistical model in the statistical model sequence and the pitch information based on the output distribution of the pitch feature parameters. The generation unit generates an output distribution sequence of acoustic feature parameters based on the number of pitch waveforms, and generates an acoustic feature parameter based on the output distribution sequence. The waveform generation unit generates a speech waveform from the generated acoustic feature parameter.
机译:防止语音质量下降和音素持续时间不自然。根据实施例的语音合成装置包括存储单元,创建单元,确定单元,生成单元和波形生成单元。存储单元在具有多个状态的统计模型的每个状态中存储包括音高特征参数的声学特征参数的输出分布以及基于时间参数的持续时间分布作为统计模型信息。创建单元从与输入文本相对应的上下文信息和统计模型信息创建统计模型序列。确定单元使用基于统计模型序列中每个统计模型的每个状态的持续时间分布的持续时间信息以及基于音调特征参数的输出分布的音调信息来确定每个状态中的音调波形的数量。生成单元基于音高波形的数量来生成声学特征参数的输出分布序列,并且基于该输出分布序列来生成声学特征参数。波形产生单元从产生的声学特征参数产生语音波形。

著录项

  • 公开/公告号JPWO2017046887A1

    专利类型

  • 公开/公告日2018-04-12

    原文格式PDF

  • 申请/专利权人 株式会社東芝;

    申请/专利号JP20170540389

  • 发明设计人 田村 正統;森田 眞弘;

    申请日2015-09-16

  • 分类号G10L13/06;

  • 国家 JP

  • 入库时间 2022-08-21 13:06:42

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号