首页> 外文OA文献 >Unsupervised intralingual and cross-lingual speaker adaptation for HMM-based speech synthesis using two-pass decision tree construction

【2h】

Unsupervised intralingual and cross-lingual speaker adaptation for HMM-based speech synthesis using two-pass decision tree construction

机译：使用两遍决策树构造的基于HMM的语音合成的无监督语内和跨语说话者适应

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Hidden Markov model (HMM)-based speech synthesis systems possess several advantages over concatenative synthesis systems. One such advantage is the relative ease with which HMM-based systems are adapted to speakers not present in the training dataset. Speaker adaptation methods used in the field of HMM-based automatic speech recognition (ASR) are adopted for this task. In the case of unsupervised speaker adaptation, previous work has used a supplementary set of acoustic models to estimate the transcription of the adaptation data. This paper first presents an approach to the unsupervised speaker adaptation task for HMM-based speech synthesis models which avoids the need for such supplementary acoustic models. This is achieved by defining a mapping between HMM-based synthesis models and ASR-style models, via a two-pass decision tree construction process. Second, it is shown that this mapping also enables unsupervised adaptation of HMM-based speech synthesis models without the need to perform linguistic analysis of the estimated transcription of the adaptation data. Third, this paper demonstrates how this technique lends itself to the task of unsupervised cross-lingual adaptation of HMM-based speech synthesis models, and explains the advantages of such an approach. Finally, listener evaluations reveal that the proposed unsupervised adaptation methods deliver performance approaching that of supervised adaptation.

机译：基于隐马尔可夫模型（HMM）的语音合成系统比串联合成系统具有多个优势。这样的优势之一是基于HMM的系统适合于训练数据集中不存在的说话者的相对简便性。为此，采用了基于HMM的自动语音识别（ASR）领域中使用的说话人自适应方法。在无人监督的说话人适应的情况下，先前的工作使用了一组声学模型来估计适应数据的转录。本文首先提出了一种针对基于HMM的语音合成模型的无监督说话人自适应任务的方法，该方法避免了对此类补充声学模型的需求。这是通过两次遍历的决策树构造过程定义基于HMM的综合模型与ASR样式模型之间的映射来实现的。其次，表明该映射还可以实现基于HMM的语音合成模型的无监督自适应，而无需对自适应数据的估计转录进行语言分析。第三，本文演示了该技术如何使其适合基于HMM的语音合成模型的无监督跨语言适应，并说明了这种方法的优势。最后，听众评估表明，所提出的无监督适应方法所提供的性能接近有监督适应的性能。

著录项

作者
Gibson Matthew; Byrne William Joseph;
展开▼
作者单位

展开▼
年度 2010
总页数
原文格式 PDF
正文语种 {"code":"en","name":"English","id":9}
中图分类
入库时间 2022-08-20 20:25:34

相似文献

外文文献
中文文献
专利

1. Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis [J] . John Dines, Hui Liang, Lakshmi Saheer, Computer speech and language . 2013,第2期

机译：个性化语音到语音翻译：基于HMM的语音合成的无监督跨语言说话者自适应
2. Speaker adaptation using context clustering decision tree for HMM-based speech synthesis [J] . Junichi YAMAGISHI, Takashi MASUKO, Keiichi TOKUDA, 電子情報通信学会技術研究報告. 音声. Speech . 2003,第264期

机译：基于上下文聚类决策树的说话人自适应，用于基于HMM的语音合成
3. Speaker adaptation using context clustering decision tree for HMM-based speech synthesis [J] . Junichi YAMAGISHI, Takashi MASUKO, Keiichi TOKUDA, 電子情報通信学会技術研究報告. 音声. Speech . 2003,第264期

机译：使用上下文聚类决策树基于HMM的语音合成的扬声器适应
4. Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using two-pass decision tree construction [C] . Gibson, Matthew, Hirsimaki, Teemu, Karhila, Reima, IEEE International Conference on Acoustics Speech and Signal;ICASSP 2010 . 2010

机译：使用两遍决策树构造的无监督跨语言说话者自适应，用于基于HMM的语音合成
5. Discriminative training for speaker adaptation and minimum Bayes risk estimation in large vocabulary speech recognition. [D] . Doumpiotis, Vlasios. 2005

机译：大词汇量语音识别中的说话人适应性和最低贝叶斯风险估计的判别训练。
6. INTERACTIONS BETWEEN UNSUPERVISED LEARNING AND THE DEGREE OF SPECTRAL MISMATCH ON SHORT-TERM PERCEPTUAL ADAPTATION TO SPECTRALLY-SHIFTED SPEECH [O] . Tianhao Li, John J. Galvin III, Qian-Jie Fu -1

机译：无监督学习和光谱失配短期知觉适应对频谱移语音程度之间相互作用
7. Unsupervised cross-lingual speaker adaptation for hmm-based speech synthesis using two-pass decision tree construction [O] . Gibson Matthew Thomas 2010

机译：使用两遍决策树构造的无监督跨语言说话者自适应，用于基于hmm的语音合成

Unsupervised intralingual and cross-lingual speaker adaptation for HMM-based speech synthesis using two-pass decision tree construction

摘要

著录项

相似文献

相关主题

期刊订阅