首页> 外国专利> COMPUTER SYSTEM CREATING SPEAKER ADAPTATION WITHOUT TEACHER IN DNN-BASED SPEECH SYNTHESIS, AND METHOD AND PROGRAM EXECUTED IN COMPUTER SYSTEM

COMPUTER SYSTEM CREATING SPEAKER ADAPTATION WITHOUT TEACHER IN DNN-BASED SPEECH SYNTHESIS, AND METHOD AND PROGRAM EXECUTED IN COMPUTER SYSTEM

机译：基于DNN的语音合成中不带教师的计算机系统创建扬声器自适应的方法，计算机系统中执行的方法和程序

页面导航

摘要
著录项
相似文献

摘要

A computer system 1 includes a speaker information estimation unit 130 that estimates the speaker information of an unknown speaker on the basis of the acoustic feature amount for the unknown speaker without the need to enter text as teacher data. The speaker information of unknown speaker includes a speaker code that represents similarity by probability between a distribution of the acoustic feature amount for the unknown speaker and a distribution for each of the acoustic feature amounts for a plurality of known speakers. The computer system 1 further comprises: a synthesized acoustic feature amount generation unit 220 for generating a synthesized acoustic feature amount for the unknown speaker on the basis of a language feature amount for an input text and the speaker information of the unknown speaker, using acoustic models (DNN) 230 of multiple speakers; and a synthesized speech generation unit 240 for generating a synthesized speech of the unknown speaker on the basis of the synthesized acoustic feature amount of the unknown speaker.

机译：计算机系统1包括说话者信息估计单元130，该说话者信息估计单元130基于未知说话者的声学特征量来估计未知说话者的说话者信息，而无需输入文本作为教师数据。未知说话者的说话者信息包括说话者代码，该说话者代码通过概率来表示未知说话者的声学特征量的分布与多个已知说话者的每个声学特征量的分布之间的相似度。计算机系统1还包括：合成声学特征量生成单元220，用于使用声学模型基于输入文本的语言特征量和未知扬声器的扬声器信息来生成未知扬声器的合成声学特征量。（DNN）230个多个扬声器;合成语音生成单元240，用于基于未知说话者的合成声学特征量生成未知说话者的合成语音。

著录项

公开/公告号WO2019044401A1

专利类型
公开/公告日2019-03-07

原文格式PDF
申请/专利权人 INTER-UNIVERSITY RESEACH INSTITUTE CORPORATION RESEARCH ORGANIZATION OF INFORMATION AND SYSTEMS;
展开▼

申请/专利号WO2018JP29438
发明设计人 YAMAGISHI JUNICHI;TAKAKI SHINJI;
展开▼

申请日2018-08-06
分类号G10L13/10;
国家 WO
入库时间 2022-08-21 11:56:11

相似文献

专利
外文文献
中文文献