A study of F0 modelling and generation with lyrics and shape characterization for singing voice synthesis

机译：歌词与歌词综合表征F0建模与发电研究

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Natural pitch fluctuation is essential to singing voice. Recently, we have proposed a generalized F0 modelling method which models the expected F0 fluctuation under various contexts with note HMMs. Knowing that having F0 contours close to human professional singing promotes perceived quality, we are confronted with two requirements: (1) accurate estimation on F0 and (2) precise voiced/unvoiced decisions. In this paper, we introduce two techniques in the above directions. Influence of lyrics phonetics on singing F0 is considered to capture the F0 and voicing behaviour brought from different note-lyrics combinations. The generalized F0 modelling method is further extended to frequency-domain to study if shape characterization in terms of sinusoids helps F0 estimation or not. Our experiments showed that the use of lyrics information leads to better F0 generation and improves naturalness of synthesized singing. While the frequency-domain representation is viable, its performance is less competitive than time-domain representation, which requires further study.

机译：自然间距波动对歌唱声音至关重要。最近，我们提出了一种广泛的F0建模方法，其在各种情况下模拟了预期的F0波动，在具有注意HMMS的各种情况下。知道具有靠近人类专业歌唱的F0轮廓促进感知质量，我们面临了两个要求：（1）对F0和（2）精确的浊音/清音决策准确估算。在本文中，我们在上述方向上介绍了两种技术。歌词语音对歌唱F0的影响被认为是捕获不同票据歌词组合带来的F0和发声行为。广义F0建模方法进一步扩展到频域以研究，如果形状表征在正弦曲线上有助于F0估计。我们的实验表明，歌词信息的使用导致更好的F0生成并提高合成歌唱的自然度。虽然频域表示是可行的，但其性能与时域表示的性能较低，这需要进一步研究。

著录项

来源
《International Symposium on Chinese Spoken Language Processing》|2012年||共5页
会议地点
作者
Lee S. W.; Dong Minghui; Li Haizhou;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
lyrics; modelling; pitch; singing; synthesis;

机译：歌词;建模;沥青;唱歌;合成;

相似文献

外文文献
中文文献
专利

1. Development of an F0 control model based on F0 dynamic characteristics for singing-voice synthesis [J] . Takeshi Saitou, Masashi Unoki, Masato Akagi Speech Communication . 2005,第3a4期

机译：基于F0动态特性的F0控制模型的歌声合成
2. HMM-based expressive singing voice synthesis with singing style control and robust pitch modeling [J] . Takashi Nose, Misa Kanemoto, Tomoki Koriyama, Computer speech and language . 2015,第1期

机译：基于HMM的表达性歌声合成，具有歌唱风格控制和可靠的音高建模
3. Music Information Retrieval from a Singing Voice Using Lyrics and Melody Information [J] . Motoyuki Suzuki, Toru Hosoya, Akinori Ito, EURASIP journal on advances in signal processing . 2006,第1期

机译：使用歌词和旋律信息从歌声中检索音乐信息
4. A study of F0 modelling and generation with lyrics and shape characterization for singing voice synthesis [C] . Lee S. W., Dong Minghui, Li Haizhou 2012 8th International Symposium on Chinese Spoken Language Processing. . 2012

机译：用歌词和形状表征进行歌声合成的F0建模和生成研究
5. The role of auditory feedback on the control of voice fundamental frequency (F0) while singing. [D] . Keough, Dwayne Nicholas. 2010

机译：听觉反馈在唱歌时控制语音基频（F0）的作用。
6. Involvement of the larynx motor area in singing-voice perception: a TMS study† [O] . Yohana Lévêque, Neil Muggleton, Lauren Stewart, 2013

机译：喉部运动区域参与歌唱声感知：TMS研究†
7. Expressive Control of Singing Voice Synthesis Using Musical Contexts and a Parametric F0 Model [O] . Ardaillon, Luc, Chabot-Canet, Céline, Roebel, Axel 2016

机译：基于音乐上下文和参数F0模型的歌唱语音合成的表达控制

A study of F0 modelling and generation with lyrics and shape characterization for singing voice synthesis

摘要

著录项

相似文献

相关主题

期刊订阅