...
首页> 外文期刊>American journal of applied sciences >MODELING OF FUNDAMENTAL FREQUENCY CONTOURS FOR THAI DIALECTS WITH LARGE SPEECH DATABASE | Science Publications
【24h】

MODELING OF FUNDAMENTAL FREQUENCY CONTOURS FOR THAI DIALECTS WITH LARGE SPEECH DATABASE | Science Publications

机译:大型语音数据库的泰国方言基本频率轮廓建模科学出版物

获取原文
           

摘要

> In four core regions of Thailand, there are four main dialects including central, north, northeast and south dialects. The prosody is significantly unique for each dialect. One important factor determining the prosody is the fundamental frequency. As a result, modeling of Fundamental frequency (F0) contour is very important for the natural speech processing. Even though there are many modeling techniques for modeling the F0 contour. In this study, the Fujisaki?s model has been selected because of its achievement in modeling of various Thai speech units. This study proposes an analysis of model parameters of Thai speech prosody for four regional dialects and two genders. Seven derived parameters from the Fujisaki?s model are as follows. The first parameter is baseline frequency which is the lowest level of F0 contour. The second and third parameters are the numbers of phrase commands and tone commands which reflect the frequencies of surges of the utterance in global and local levels, respectively. The fourth and fifth parameters are phrase command and tone command durations which reflect the speed of speaking and the length of a syllable, respectively. The sixth and seventh parameters are amplitudes of phrase command and tone command which reflect the energy of the global speech and the energy of local syllable. In the experimental results, the large speech material of each regional dialect includes 50 samples of 50 sentences with male and female speech. It can be obviously seen that most of the proposed parameters can distinguish four kinds of regional dialects explicitly. The results reveal that the proposed parameters of Fujisaki?s model can distinguish the regional dialects explicitly.
机译: >在泰国的四个核心地区,主要有四种方言,包括中部,北部,东北和南部方言。每个方言的韵律都非常独特。决定韵律的一个重要因素是基频。因此,基本频率(F0)轮廓的建模对于自然语音处理非常重要。即使有许多用于对F0轮廓进行建模的建模技术。在这项研究中,选择了Fujisaki的模型是因为它在各种泰国语音单元的建模方面取得了成就。这项研究提出了四种区域性方言和两种性别的泰国语音韵律模型参数的分析。藤崎模型的七个导出参数如下。第一个参数是基线频率,它是F0轮廓的最低水平。第二和第三参数是短语命令和音调命令的数量,其分别反映整体和局部水平上的发音波动的频率。第四和第五参数是短语命令和音调命令的持续时间,它们分别反映说话的速度和音节的长度。第六和第七参数是短语命令和音调命令的幅度,其反映整体语音的能量和局部音节的能量。在实验结果中,每个地区方言的大型语音材料包括50个带有50种句子的男女语音样本。可以明显看出,大多数提出的参数可以明确地区分四种区域性方言。结果表明,所提出的藤崎模型参数可以明确地区性方言。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号