Speaker adaptation using context clustering decision tree for HMM-based speech synthesis

Junichi YAMAGISHI; Takashi MASUKO; Keiichi TOKUDA; Takao KOBAYASHI

首页> 外文期刊>電子情報通信学会技術研究報告. 音声. Speech >Speaker adaptation using context clustering decision tree for HMM-based speech synthesis

【24h】

Speaker adaptation using context clustering decision tree for HMM-based speech synthesis

机译：基于上下文聚类决策树的说话人自适应，用于基于HMM的语音合成

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In order to synthesize speech with arbitrary individualities and/or emotional expressions, segment-based features have to be used as well as frame-based features. In this paper, to realize MLLR (Maximum Likelihood Liner Regression) based speaker adaptation reflecting those segment-based features for HMM-based speech synthesis, we propose a technique for applying context clustering decision trees constructed in a training stage to tying of regression matrices. Since a set of questions used for constructing context clustering decision trees contains questions related to segment-based features such as position and length, it is possible to incorporate segment-based features into the adaptation. We show that synthesized speech from the adapted model using the proposed technique can have segment-based features.

机译：为了合成具有任意个性和/或情感表达的语音，必须使用基于片段的特征以及基于帧的特征。在本文中，为了实现基于MLLR（最大似然线性回归）的说话人自适应，以反映那些基于片段的特征，用于基于HMM的语音合成，我们提出了一种技术，该技术将在训练阶段构造的上下文聚类决策树应用于回归矩阵的绑定。由于用于构建上下文聚类决策树的一组问题包含与基于片段的特征（例如位置和长度）相关的问题，因此可以将基于片段的特征合并到适应中。我们表明，使用提出的技术从适应模型合成语音可以具有基于片段的特征。

著录项

来源
《電子情報通信学会技術研究報告. 音声. Speech》 |2003年第264期|共6页
作者
Junichi YAMAGISHI; Takashi MASUKO; Keiichi TOKUDA; Takao KOBAYASHI;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 jpn
中图分类电报、传真;
关键词
HMM-based speech synthesis; Speaker adaptation; Maximum likelihood liner regression; Decision tree; Voice characteristics and prosodic features; Segment-based features;

机译：基于HMM的语音合成;说话人自适应;最大似然线性回归;决策树;语音特征和韵律特征;基于分段的特征;
入库时间 2022-08-19 10:22:04

相似文献

外文文献
中文文献
专利

1. Speaker adaptation using context clustering decision tree for HMM-based speech synthesis [J] . Junichi YAMAGISHI, Takashi MASUKO, Keiichi TOKUDA, 電子情報通信学会技術研究報告. 音声. Speech . 2003,第264期

机译：基于上下文聚类决策树的说话人自适应，用于基于HMM的语音合成
2. Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis [J] . John Dines, Hui Liang, Lakshmi Saheer, Computer speech and language . 2013,第2期

机译：个性化语音到语音翻译：基于HMM的语音合成的无监督跨语言说话者自适应
3. Analysis of Speaker Adaptation Algorithms for HMM-Based Speech Synthesis and a Constrained SMAPLR Adaptation Algorithm [J] . Yamagishi J., Kobayashi T., Nakano Y., IEEE transactions on audio, speech and language processing . 2009,第1期

机译：基于HMM的语音合成的说话人自适应算法和约束SMAPLR自适应算法的分析
4. SPEAKING STYLE ADAPTATION USING CONTEXT CLUSTERING DECISION TREE FOR HMM-BASED SPEECH SYNTHESIS [C] . Junichi Yamagishi, Makoto Tachibana, Takashi Masuko, IEEE International Conference on Acoustics, Speech, and Signal Processing . 2004

机译：使用上下文聚类决策树对基于HMM的语音合成的语言适应性
5. HMM-based non-intrusive speech quality and implementation of Viterbi score distribution and hiddenness based measures to improve the performance of speech recognition [D] . Talwar, Gaurav 2006

机译：基于HMM的非侵入式语音质量以及基于Viterbi分数分布和隐蔽性的措施的实施，以提高语音识别的性能
6. How the Human Brain Recognizes Speech in the Context of Changing Speakers [O] . Katharina von Kriegstein, David R. R. Smith, Roy D. Patterson, 2010

机译：在说话者不断变化的背景下人脑如何识别语音
7. Unsupervised intralingual and cross-lingual speaker adaptation for HMM-based speech synthesis using two-pass decision tree construction [O] . Gibson Matthew, Byrne William Joseph 2010

机译：使用两遍决策树构造的基于HMM的语音合成的无监督语内和跨语说话者适应

Speaker adaptation using context clustering decision tree for HMM-based speech synthesis

摘要

著录项

相似文献

相关主题

期刊订阅