首页> 外文期刊>Journal of the royal statistical society >The statistical analysis of acoustic phonetic data: exploring differences between spoken Romance languages
【24h】

The statistical analysis of acoustic phonetic data: exploring differences between spoken Romance languages

机译:语音数据的统计分析:探索口头浪漫语言之间的差异

获取原文
获取原文并翻译 | 示例
           

摘要

The historical and geographical spread from older to more modern languages has long been studied by examining textual changes and in terms of changes in phonetic transcriptions. However, it is more difficult to analyse language change from an acoustic point of view, although this is usually the dominant mode of transmission. We propose a novel analysis approach for acoustic phonetic data, where the aim will be to model the acoustic properties of spoken words statistically. We explore phonetic variation and change by using a time-frequency representation, namely the log-spectrograms of speech recordings. We identify time and frequency covariance functions as a feature of the language; in contrast, mean spectrograms depend mostly on the particular word that has been uttered. We build models for the mean and covariances (taking into account the restrictions placed on the statistical analysis of such objects) and use these to define a phonetic transformation that models how an individual speaker would sound in a different language, allowing the exploration of phonetic differences between languages. Finally, we map back these transformations to the domain of sound recordings, enabling us to listen to the output of the statistical analysis. The approach proposed is demonstrated by using recordings of the words corresponding to the numbers from 1 to 10 as pronounced by speakers from five different Romance languages.
机译:长期以来,人们一直通过研究文本变化和语音转录变化来研究从较旧语言到现代语言的历史和地理传播。但是,尽管这通常是主要的传输方式,但从声学角度分析语言变化更加困难。我们提出了一种针对语音数据的新颖分析方法,其目的是统计地对口语单词的声学特性进行建模。我们通过使用时频表示(即语音记录的对数频谱图)来探索语音的变化和变化。我们将时间和频率协方差函数确定为语言的特征。相反,平均声谱图主要取决于所发出的特定单词。我们建立均值和协方差的模型(考虑到对此类对象的统计分析所施加的限制),并使用这些模型定义语音转换,该语音转换可模拟单个说话者在不同语言中的发音方式,从而探索语音差异语言之间。最后,我们将这些转换映射回录音领域,从而使我们能够收听统计分析的输出。通过使用与五种不同罗曼语言的发音者所发音的单词对应的从1到10的数字的录音来证明所提出的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号