F_0 contour generation and synthesis using Bengali Hmm-based speech synthesis system

Sankar Mukherjee; Shyamal Kumar Das Mandal

首页> 外文期刊>International journal of speech technology >F_0 contour generation and synthesis using Bengali Hmm-based speech synthesis system

【24h】

F_0 contour generation and synthesis using Bengali Hmm-based speech synthesis system

机译：使用基于孟加拉语Hmm的语音合成系统进行F_0轮廓生成和合成

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

HMM based Bengali speech synthesis system (Bengali-HTS) generates highly intelligible synthesized speech but its naturalness is not adequate even though it is trained with a very good amount of speech corpus. In case of interrogative, imperative and exclamatory sentences, naturalness of the synthesized speech falls drastically. This paper proposes a method to overcome this problem by modifying the Fo contour of synthetic speech based on Fujisaki model. The Fujisaki model features for different types of Bengali sentences are analyzed for the generation of F_0 contour. These features depend on prosodic word/phrase boundary of the sentence. So a two layer supervised classification and regression tree is trained to predict the prosodic word/phrase boundary. Fujisaki model then generates Fo contour from input text using the prosodic word/phrase boundary and segmen-tal duration information from HMM-based speech synthesis system. Moreover, for HMM training purpose, prosodic structure of sentence has been employed rather than lexical structure. From MOS and preference test it is found that proposed method significantly improved the overall quality of synthesized speech than that of Bengali-HTS.

机译：基于HMM的孟加拉语语音合成系统（Bengali-HTS）生成高度可理解的合成语音，但是即使使用大量语音语料进行训练，其自然性也不足。在疑问句，命令句和感叹词的情况下，合成语音的自然性急剧下降。本文提出了一种基于Fujisaki模型的修正合成语音的Fo轮廓的方法。分析了不同类型的孟加拉语句子的Fujisaki模型特征，以生成F_0轮廓。这些特征取决于句子的韵律词/短语边界。因此，训练了两层监督分类和回归树来预测韵律词/短语边界。然后，Fujisaki模型使用韵律词/短语边界和基于HMM的语音合成系统的段持续时间信息，从输入文本生成Fo轮廓。此外，出于HMM训练的目的，已采用句子的韵律结构而不是词汇结构。通过MOS和偏好测试发现，所提出的方法比Bengali-HTS显着提高了合成语音的整体质量。

著录项

来源
《International journal of speech technology》 |2015年第1期|25-36|共12页
作者
Sankar Mukherjee; Shyamal Kumar Das Mandal;
展开▼
作者单位

Centre for Educational Technology, Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India;

Centre for Educational Technology, Indian Institute of Technology Kharagpur, Kharagpur 721302, West Bengal, India;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
F_0 contour modification; Bengali-HTS; Fujisaki model; Prosodic word prediction; Prosodic phrase prediction; Speech synthesis;

机译：F_0轮廓修改;孟加拉语-HTS;藤崎模型;韵律词预测;韵律短语预测;语音合成;

相似文献

外文文献
中文文献
专利

1. Synthesis of F_0 contours using generation process model parameters predicted from unlabeled corpora: application to emotional speech synthesis [J] . Keikichi Hirose, Kentaro Sato, Yasufumi Asano, Speech Communication . 2005,第3a4期

机译：使用从未标记的语料库预测的生成过程模型参数来合成F_0轮廓：在情感语音合成中的应用
2. Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System [J] . Dhanalakshmi M., Celin T. A. Mariya, Nagarajan T., Circuits, systems, and signal processing . 2018,第2期

机译：基于HMM的语音识别和自适应合成系统的韵律演讲者的语音输入语音输出通信
3. Generation and perception of F_0 markedness for communicative speech synthesis [J] . Yoshinori Sagisaka, Takumi Yamashita, Yoko Kokenawa Speech Communication . 2005,第3a4期

机译：用于语音合成的F_0标记的产生和感知
4. Improvement in Corpus-Based Generation of F_0 Contours Using Generation Process Model for Emotional Speech Synthesis [C] . Keikichi Hirose, Kentaro Sato, Nobuaki Minematsu International Conference on Spoken Language Processing; 20041004-08; Jeju(KR) . 2004

机译：使用生成过程模型进行情感语音合成，改进基于语料库的F_0轮廓生成
5. Achieving robust performance for electrohydraulic servo systems: An H-infinity/mu-synthesis solution for manipulation and contouring. [D] . Vossoughi, Gholamreza. 1992

机译：实现电动液压伺服系统的强大性能：用于操纵和轮廓绘制的H无限/μ合成解决方案。
6. Intrinsic Resistance to Inhibitors of Fatty Acid Biosynthesis in Pseudomonas aeruginosa Is Due to Efflux: Application of a Novel Technique for Generation of Unmarked Chromosomal Mutations for the Study of Efflux Systems [O] . Herbert P. Schweizer 1998

机译：铜绿假单胞菌对脂肪酸生物合成抑制剂的内在抗性归因于外排：一种新技术的应用用于产生未标记的染色体突变用于外排系统的研究
7. A speech parameter generation algorithm considering global variance for HMM-based speech synthesis [O] . Tomoki Toda, Keiichi Tokuda 2007

机译：基于HMM的语音合成中考虑全局方差的语音参数生成算法

F_0 contour generation and synthesis using Bengali Hmm-based speech synthesis system

摘要

著录项

相似文献

相关主题

期刊订阅