首页> 中文期刊> 《计算机工程与科学》 >基于PSO优化GRNN的语音转换方法

基于PSO优化GRNN的语音转换方法

         

摘要

The paper proposes a new voice conversion method based on using Particle Swarm Optimization (PSO)to optimize General Regression Neural Network (GRNN).Firstly,the method utilizes the characteristic parameters of the training speaker's vocal tract and source excitation to train two GRNNs,and then obtains the structure parameters of GRNNs.Secondly,in order to reduce the adverse impact of artificial man-induced factors on conversion results,PSO is used to optimize the parameters of the GRNN model.Finally,the pitch contour and the energy profile of prosodic features are linearly converted,thus making the converted voice contain more personalized feature information of source speaker.Experimental results show that,compared with the radial basis function neural network(RBF) and the GRNN based voice conversion methods,our method improves the naturalness and likelihood of the converted voices and evidently decreases the spectral distortion rate,so the converted voices are more closed to the target voices.%提出了一种基于粒子群算法PSO优化广义回归神经网络GRNN模型的语音转换方法.首先,该方法利用训练语音的声道和激励源的个性化特征参数分别训练两个GRNN,得到GRNN的结构参数;然后,利用PSO对GRNN的结构参数进行优化,减少人为因素对转换结果的影响;最后,对语音的韵律特征、基音轮廓和能量分别进行了线性转换,使得转换后的语音包含更多源语音的个性化特征信息.主客观实验结果表明:与径向基神经网络RBF和GRNN相比,使用本文提出的转换模型获得的转换语音的自然度和似然度都得到了很大的提升,谱失真率明显降低并且更接近于目标语音.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号