首页> 外文会议>International Symposium on Chinese Spoken Language Processing >A study of F0 modelling and generation with lyrics and shape characterization for singing voice synthesis
【24h】

A study of F0 modelling and generation with lyrics and shape characterization for singing voice synthesis

机译:歌词与歌词综合表征F0建模与发电研究

获取原文

摘要

Natural pitch fluctuation is essential to singing voice. Recently, we have proposed a generalized F0 modelling method which models the expected F0 fluctuation under various contexts with note HMMs. Knowing that having F0 contours close to human professional singing promotes perceived quality, we are confronted with two requirements: (1) accurate estimation on F0 and (2) precise voiced/unvoiced decisions. In this paper, we introduce two techniques in the above directions. Influence of lyrics phonetics on singing F0 is considered to capture the F0 and voicing behaviour brought from different note-lyrics combinations. The generalized F0 modelling method is further extended to frequency-domain to study if shape characterization in terms of sinusoids helps F0 estimation or not. Our experiments showed that the use of lyrics information leads to better F0 generation and improves naturalness of synthesized singing. While the frequency-domain representation is viable, its performance is less competitive than time-domain representation, which requires further study.
机译:自然间距波动对歌唱声音至关重要。最近,我们提出了一种广泛的F0建模方法,其在各种情况下模拟了预期的F0波动,在具有注意HMMS的各种情况下。知道具有靠近人类专业歌唱的F0轮廓促进感知质量,我们面临了两个要求:(1)对F0和(2)精确的浊音/清音决策准确估算。在本文中,我们在上述方向上介绍了两种技术。歌词语音对歌唱F0的影响被认为是捕获不同票据歌词组合带来的F0和发声行为。广义F0建模方法进一步扩展到频域以研究,如果形状表征在正弦曲线上有助于F0估计。我们的实验表明,歌词信息的使用导致更好的F0生成并提高合成歌唱的自然度。虽然频域表示是可行的,但其性能与时域表示的性能较低,这需要进一步研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号