Multi-speaker speech synthesis and speaker adaptation based on deep bidirectional long short-term memory recurrent neural network

Yi ZHAO; Nobuaki MINEMATSU; Daisuke SAITO

首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Multi-speaker speech synthesis and speaker adaptation based on deep bidirectional long short-term memory recurrent neural network

【24h】

Multi-speaker speech synthesis and speaker adaptation based on deep bidirectional long short-term memory recurrent neural network

机译：基于深度双向长短期记忆递归神经网络的多说话人语音合成与说话人自适应

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, a deep bidirectional long short-term memory recurrent neural network (DBLSTM-RNN) based multi-speaker synthesis model is proposed to improve the synthesis quality for a target speaker whose corpus is limited. This model consists of speaker independent network (SIN) and speaker dependent network (SDN), where SIN is jointly trained by multiple speakers and SDN is designed for designed for each of the target speakers. In particular, gender code as well as speaker code or i-vector are prepared as augmented input information to help SIN realize better distinction among different target speakers. Experimental results show that our proposed model improves the synthesis performance with a fairly small database for each speaker, compared with DNN-based multi--speaker TTS and conventional DBLSTM-RNN based TTS. In addition, this multi-speaker model can also be used to perform speaker adaptation, and is experimentally shown to be capable of achieving good quality speech of a new speaker in terms of naturalness and speaker identity.

机译：为了提高语料库有限的目标说话者的合成质量，本文提出了一种基于深度双向双向长短期记忆递归神经网络（DBLSTM-RNN）的多说话者综合模型。此模型由独立于扬声器的网络（SIN）和依赖扬声器的网络（SDN）组成，其中SIN由多个扬声器共同训练，并且SDN是为每个目标扬声器而设计的。特别地，准备性别代码以及说话者代码或i-vector作为增强的输入信息，以帮助SIN更好地区分不同目标说话者。实验结果表明，与基于DNN的多扬声器TTS和基于DBLSTM-RNN的传统TTS相比，我们提出的模型可以通过每个发言人较小的数据库来提高综合性能。另外，该多说话者模型也可以用于执行说话者自适应，并且在自然上和说话者身份方面通过实验证明能够实现新说话者的高质量语音。

著录项

来源
《電子情報通信学会技術研究報告. 音声. Speech》 |2015年第346期|共6页
作者
Yi ZHAO; Nobuaki MINEMATSU; Daisuke SAITO;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电报、传真;
关键词
Multi-speaker speech synthesis; Speaker adaptation; DBLSTM-RNN; Speaker code; I-vector;

机译：多说话人语音合成;说话人自适应;DBLSTM-RNN;说话人代码;I矢量;

相似文献

外文文献
中文文献
专利

1. Multi-speaker speech synthesis and speaker adaptation based on deep bidirectional long short-term memory recurrent neural network [J] . Yi ZHAO, Nobuaki MINEMATSU, Daisuke SAITO 電子情報通信学会技術研究報告. 音声. Speech . 2015,第346期

机译：基于深度双向长短期记忆递归神经网络的多说话人语音合成与说话人自适应
2. Hierarchical Bayesian combination of plug-in maximum a posteriori decoders in deep neural networks-based speech recognition and speaker adaptation [J] . Huang Zhen, Siniscalchi Sabato Marco, Lee Chin-Hui Pattern recognition letters . 2017,第octa15期

机译：基于深度神经网络的语音识别和说话人自适应的插件最大后验解码器的分层贝叶斯组合
3. Discriminative Learning of Filterbank Layer within Deep Neural Network Based Speech Recognition for Speaker Adaptation [J] . Hiroshi SEKI, Kazumasa YAMAMOTO, Tomoyosi AKIBA, IEICE transactions on information and systems . 2019,第2期

机译：基于深度神经网络的说话人自适应语音识别的判别学习
4. Speech enhancement using Long Short-Term Memory based recurrent Neural Networks for noise robust Speaker Verification [C] . Morten Kolbœk, Zheng-Hua Tan, Jesper Jensen IEEE Workshop on Spoken Language Technology . 2016

机译：使用基于长期短期记忆的递归神经网络进行语音增强以增强对噪声的说话人验证
5. Non-linguistic Vocalization Recognition Based on Convolutional, Long Short-term Memory, Deep Neural Networks [D] . Qiu, Liang. 2018

机译：基于卷积，长短时记忆，深度神经网络的非语言语音识别
6. Online Prediction of Ship Behavior with Automatic Identification System Sensor Data Using Bidirectional Long Short-Term Memory Recurrent Neural Network [O] . Miao Gao, Guoyou Shi, Shuang Li 2018

机译：双向长短期记忆递归神经网络通过自动识别系统传感器数据对船舶行为进行在线预测
7. Constructing Long Short-Term Memory based Deep Recurrent Neural Networks for Large Vocabulary Speech Recognition [O] . Li, Xiangang, Wu, Xihong 2015

机译：基于深度递归神经网络构建长短期记忆用于大词汇量语音识别

Multi-speaker speech synthesis and speaker adaptation based on deep bidirectional long short-term memory recurrent neural network

摘要

著录项

相似文献

相关主题

期刊订阅