首页> 外文会议>Annual Conference of the International Speech Communication Association >Parallel Dictionary Learning for Voice Conversion Using Discriminative Graph-embedded Non-negative Matrix Factorization
【24h】

Parallel Dictionary Learning for Voice Conversion Using Discriminative Graph-embedded Non-negative Matrix Factorization

机译:使用鉴别图嵌入非负矩阵分解的语音转换的并行词典学习

获取原文

摘要

This paper proposes a discriminative learning method for Non-negative Matrix Factorization (NMF)-based Voice Conversion (VC). NMF-based VC has been researched because of the natural-sounding voice it produces compared with conventional Gaussian Mixture Model (GMM)-based VC. In conventional NMF-based VC, parallel exemplars are used as the dictionary; therefore, dictionary learning is not adopted. In order to enhance the conversion quality of NMF-based VC, we propose Discriminative Graph-embedded Non-negative Matrix Factorization (DGNMF). Parallel dictionaries of the source and target speakers are discriminatively estimated by using DGNMF based on the phoneme labels of the training data. Experimental results show that our proposed method can not only improve the conversion quality but also reduce the computational times.
机译:本文提出了基于非负矩阵分解(NMF)的语音转换(VC)的判别学习方法。基于NMF的VC已经研究,因为与传统的高斯混合模型(GMM)相比,它产生的自然发声声为基础的VC。在传统的基于NMF的VC中,并行示例用作字典;因此,没有采用字典学习。为了提高基于NMF的VC的转换质量,我们提出了鉴别的图形嵌入非负矩阵分解(DGNMF)。通过使用基于训练数据的音素标签使用DGNMF来判别源和目标扬声器的并行词典。实验结果表明,我们所提出的方法不仅可以提高转换质量,还可以减少计算时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号