Parallel Dictionary Learning for Voice Conversion Using Discriminative Graph-embedded Non-negative Matrix Factorization

机译：使用鉴别图嵌入非负矩阵分解的语音转换的并行词典学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a discriminative learning method for Non-negative Matrix Factorization (NMF)-based Voice Conversion (VC). NMF-based VC has been researched because of the natural-sounding voice it produces compared with conventional Gaussian Mixture Model (GMM)-based VC. In conventional NMF-based VC, parallel exemplars are used as the dictionary; therefore, dictionary learning is not adopted. In order to enhance the conversion quality of NMF-based VC, we propose Discriminative Graph-embedded Non-negative Matrix Factorization (DGNMF). Parallel dictionaries of the source and target speakers are discriminatively estimated by using DGNMF based on the phoneme labels of the training data. Experimental results show that our proposed method can not only improve the conversion quality but also reduce the computational times.

机译：本文提出了基于非负矩阵分解（NMF）的语音转换（VC）的判别学习方法。基于NMF的VC已经研究，因为与传统的高斯混合模型（GMM）相比，它产生的自然发声声为基础的VC。在传统的基于NMF的VC中，并行示例用作字典;因此，没有采用字典学习。为了提高基于NMF的VC的转换质量，我们提出了鉴别的图形嵌入非负矩阵分解（DGNMF）。通过使用基于训练数据的音素标签使用DGNMF来判别源和目标扬声器的并行词典。实验结果表明，我们所提出的方法不仅可以提高转换质量，还可以减少计算时间。

著录项

来源
《Annual Conference of the International Speech Communication Association》|2016年|744p|共5页
会议地点
作者
Ryo Aihara; Tetsuya Takiguchi; Yasuo Ariki;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TB95-53;
关键词

相似文献

外文文献
中文文献
专利

1. Discriminative Graph-embedded Non-negative Matrix Factorizationを用いた声質変換のためのパラレル辞書学習 [J] . 相原龍, 滝口哲也, 有木康雄電子情報通信学会技術研究報告. 音声. Speech . 2016,第189期

机译：使用判别图嵌入非负矩阵分解进行并行词典学习以进行语音质量转换
2. Discriminative Graph-embedded Non-negative Matrix Factorizationを用いた声質変換のためのパラレル辞書学習 [J] . 相原龍, 滝口哲也, 有木康雄電子情報通信学会技術研究報告. 音声. Speech . 2016,第189期

机译：使用鉴别图嵌入非负矩阵分解的声导转换器的并行词典学习
3. Toward semantic attributes in dictionary learning and non-negative matrix factorization [J] . Babaee Mohammadreza, Wolf Thomas, Rigoll Gerhard Pattern recognition letters . 2016,第sepa1期

机译：在字典学习和非负矩阵分解中对语义属性的追求
4. Parallel Dictionary Learning for Voice Conversion Using Discriminative Graph-embedded Non-negative Matrix Factorization [C] . Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki Annual Conference of the International Speech Communication Association . 2016

机译：使用鉴别图嵌入非负矩阵分解的语音转换的并行词典学习
5. Group Convex Orthogonal Non-negative Matrix Tri-Factorization with Applications in FC Fingerprinting [D] . ?Li, Kendrick 2020

机译：集团凸正交非负矩阵三分解与 FC 指纹应用
6. Label-Informed Non-negative Matrix Factorization with Manifold Regularization for Discriminative Subnetwork Detection [O] . Takanori Watanabe, Birkan Tunc, Drew Parker, -1

机译：具有歧管正则化的标签信息非负矩阵分解用于判别子网检测
7. Individuality-preserving Voice Conversion for Articulation Disorders Using Dictionary Selective Non-negative Matrix Factorization [O] . Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki 2015

机译：使用字典选择性非负矩阵分解来保持关节紊乱的个性保持语音转换

Parallel Dictionary Learning for Voice Conversion Using Discriminative Graph-embedded Non-negative Matrix Factorization

摘要

著录项

相似文献

相关主题

期刊订阅