首页> 外国专利> Computer system for unsupervised speaker adaptation of DNN speech synthesis, method and program implemented in the computer system

Computer system for unsupervised speaker adaptation of DNN speech synthesis, method and program implemented in the computer system

机译:用于DNN语音合成的无监督说话者自适应的计算机系统,在该计算机系统中实现的方法和程序

摘要

The computer system 1 includes a speaker information estimation unit 130 that estimates speaker information of an unknown speaker based on acoustic features of the unknown speaker without requiring input of text as teacher data. The speaker information of the unknown speaker includes a speaker code representing the degree of similarity between the distribution of the acoustic feature of the unknown speaker and the distribution of the acoustic features of each of the plurality of known speakers as a probability. The computer system 1 uses the multi-speaker acoustic model (DNN) 230 to generate synthesized acoustic features of the unknown speaker based on the input language feature of the text and the speaker information of the unknown speaker. It further includes a synthetic acoustic feature quantity generation unit 220 for generating an amount, and a synthetic speech generation unit 240 for generating a synthesized speech of the unknown speaker based on the synthesized acoustic feature quantity of the unknown speaker.
机译:计算机系统1包括说话者信息估计单元130,该说话者信息估计单元130基于未知说话者的声学特征来估计未知说话者的说话者信息,而不需要输入文本作为教师数据。未知讲话者的讲话者信息包括代表概率的讲话者代码,该讲话者代码表示未知讲话者的声学特征的分布与多个已知讲话者中的每一个的声学特征的分布之间的相似度。计算机系统1使用多扬声器声学模型(DNN)230基于文本的输入语言特征和未知讲话者的讲话者信息来生成未知讲话者的合成声学特征。它还包括用于生成量的合成声学特征量生成单元220,以及用于基于未知说话者的合成声学特征量来生成未知说话者的合成语音的合成语音生成单元240。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号