Basis-Based Speaker Adaptation Using Partitioned HMM Mean Parameters of Training Speaker Models

Jeong Yongwon

首页> 外文期刊>Journal of signal processing systems for signal, image, and video technology >Basis-Based Speaker Adaptation Using Partitioned HMM Mean Parameters of Training Speaker Models

【24h】

Basis-Based Speaker Adaptation Using Partitioned HMM Mean Parameters of Training Speaker Models

机译：基于分区HMM均值参数的训练说话者模型的基于说话人的自适应

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents the basis-based speaker adaptation method that includes approaches using principal component analysis (PCA) and two-dimensional PCA (2DPCA). The proposed method partitions the hidden Markov model (HMM) mean vectors of training models into subvectors of smaller dimension. Consequently, the sample covariance matrix computed using the partitioned HMM mean vectors has various dimensions according to the dimension of the subvectors. From the eigen-decomposition of the sample covariance matrix, basis vectors are constructed. Thus, the dimension of basis vectors varies according to the dimension of the sample covariance matrix, and the proposed method includes PCA and 2DPCA-based approaches. We present the adaptation equation in both the maximum likelihood (ML) and maximum a posteriori (MAP) frameworks. We perform continuous speech recognition experiments using the Wall Street Journal (WSJ) corpus. The results show that the model with basis vectors whose dimensions are between those of PCA and 2DPCA-based approaches shows good overall performance. The proposed approach in the MAP framework shows additional performance improvement over the ML counterpart when the number of adaptation parameters is large but the amount of available adaptation data is small. Furthermore, the performance of the approach in the MAP framework approach is less sensitive to the choice of model order than the ML counterpart.

机译：本文介绍了基于基础的说话人自适应方法，其中包括使用主成分分析（PCA）和二维PCA（2DPCA）的方法。该方法将训练模型的隐马尔可夫模型（HMM）均值向量划分为较小维的子向量。因此，根据子向量的维数，使用划分后的HMM平均向量计算的样本协方差矩阵具有不同的维数。根据样本协方差矩阵的特征分解，构建基向量。因此，基向量的维数根据样本协方差矩阵的维数而变化，并且所提出的方法包括基于PCA和2DPCA的方法。我们在最大似然（ML）和最大后验（MAP）框架中提出了适应方程。我们使用《华尔街日报》（WSJ）语料库进行连续的语音识别实验。结果表明，具有基本向量且维数在PCA和基于2DPCA的方法之间的模型具有良好的整体性能。当自适应参数的数量大而可用自适应数据的数量少时，MAP框架中提出的方法显示出比ML对应方法更好的性能。此外，与ML对应方法相比，MAP框架方法中方法的性能对模型顺序的选择不太敏感。

著录项

来源
《Journal of signal processing systems for signal, image, and video technology》 |2016年第3期|303-310|共8页
作者
Jeong Yongwon;
展开▼
作者单位

Pusan Natl Univ, Dept Elect Engn, Busan 609735, South Korea;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Eigenvoice adaptation; Speaker adaptation; Speech recognition; Two-dimensional PCA;

机译：特征语音适应;说话人适应;语音识别;二维PCA;

相似文献

外文文献
中文文献
专利

1. Online Bayesian tree-structured transformation of HMMs with optimalmodel selection for speaker adaptation [J] . Shaojun Wang, Yunxin Zhao IEEE Transactions on Speech and Audio Proceessing . 2001,第6期

机译：HMM的在线贝叶斯树结构变换以及用于说话人自适应的最佳模型选择
2. Unified framework for basis-based speaker adaptation using 2-mode analysis [J] . Y. Jeong Electronicsletters . 2009,第21期

机译：使用2模分析的基于基础的说话人适应的统一框架
3. Speaker Adaptive Training Localizing Speaker Modules in DNN for Hybrid DNN-HMM Speech Recognizers [J] . Tsubasa OCHIAI, Shigeki MATSUDA, Hideyuki WATANABE, IEICE transactions on information and systems . 2016,第10期

机译：混合DNN-HMM语音识别器中DNN中的说话人自适应训练本地化说话人模块
4. Structured modeling based on generalized variable parameter HMMs and speaker adaptation [C] . Li Yang, Liu Xunying, Wang Lan 2012 8th International Symposium on Chinese Spoken Language Processing. . 2012

机译：基于广义可变参数HMM和说话人自适应的结构化建模
5. Speaker Characteristic-based Acoustic Model Adaptation Method for Speaker Recognition Systems [D] . Millington, Daniel S. 2011

机译：基于说话者特征的说话人识别系统声学模型自适应方法
6. Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition [O] . Myungjong Kim, Younggwan Kim, Joohong Yoo, -1

机译：KL-HMM的正则化说话人适应用于音调异常语音识别
7. Speaker similarity evaluation of foreign-accented speech synthesis using HMM-based speaker adaptation [O] . Wester, M., Karhila, R. 2011

机译：基于Hmm的说话人适应的外语重音语音合成的说话人相似度评估

Basis-Based Speaker Adaptation Using Partitioned HMM Mean Parameters of Training Speaker Models

摘要

著录项

相似文献

相关主题

期刊订阅