首页> 外文学位 >Exploitation of effective temporal cues for lexical tone recognition of Chinese.
【24h】

Exploitation of effective temporal cues for lexical tone recognition of Chinese.

机译:开发有效的时态线索,用于汉语词汇音调识别。

获取原文
获取原文并翻译 | 示例

摘要

Lexical tone plays an important role in tonal languages. Acoustically, pitch is determined by the periodicity of speech, which is measured as the fundamental frequency (F0) of acoustic signals. In each tonal language, there are a certain number of lexical tones that are described by distinctive pitch contours. Cantonese and Mandarin have four and six tones, respectively.;People with sensorineural hearing loss have difficulty in utilizing spectral information for speech recognition and rely heavily on temporal information. The temporal information of speech is divided into three parts, based on the rate of amplitude fluctuation: temporal envelope (below 50 Hz), periodicity (50-500 Hz), and fine structure (above 500 Hz).;The goals of this thesis are to investigate what are the effective temporal cues for lexical tone perception of Chinese and how to manipulate or enhance these cues for better performance of tone perception. We adopt the research method of acoustic simulation with normal-hearing subjects. A four-channel noise-excited vocoder is used to generate test stimuli for tone identification.;We compare the contributions of temporal envelope and periodicity components (TEPCs) from different frequency regions to tone recognition in Cantonese and Mandarin. It is observed that TEPCs from high-frequency region (1-4 kHz) are more important than those from low-frequency region ( 1 kHz). In noise condition, tone recognition performance with temporal cues degrades and more spectral information is needed.;Previous studies show that hearing-impaired people have difficulties in perceiving tones, even though they are aided with cochlear implants (CIs). In this thesis, two approaches are investigated to improve Chinese tone recognition. In the first approach, TEPCs go through a process of non-linear expansion in order to increase the modulation depth of periodicity-related amplitude fluctuation. Results of listening tests show that TEPC expansion leads to a noticeable improvement on tone identification accuracy. In the second approach, the effectiveness of enhancing temporal periodicity cues in noise is investigated. Temporal periodicity cues are simplified into a sinusoidal wave with frequency equivalent to the F0 of speech. This leads to a consistent and significant improvement on tone identification performance at different noise levels. This part of research is expected to be helpful in designing CI processing strategy for effective speech perception of tonal languages.
机译:词汇语调在音调语言中起着重要的作用。听觉上,音调取决于语音的周期性,该周期性被测量为声学信号的基频(F0)。在每种音调语言中,都有一定数量的词汇音调,它们由独特的音高轮廓来描述。广东话和普通话分别有四个和六个音调。患有感音神经性听力丧失的人难以利用频谱信息进行语音识别,并且严重依赖时间信息。根据幅度波动的速率,将语音的时间信息分为三个部分:时间包络(50 Hz以下),周期性(50-500 Hz)和精细结构(500 Hz以上)。我们将研究什么是汉语词汇音调感知的有效时空线索,以及如何操纵或增强这些线索以更好地表现音调感知。我们采用正常听觉主体的声学模拟研究方法。使用四通道噪声激励声码器生成用于音调识别的测试刺激。我们比较了不同频率区域的时间包络和周期性分量(TEPC)对粤语和普通话音调识别的贡献。可以看出,来自高频区域(1-4 kHz)的TEPC比来自低频区域(<1 kHz)的TEPC更重要。在噪声条件下,具有时间提示的音调识别性能会下降,需要更多的频谱信息。;以前的研究表明,即使有耳蜗植入(CIs)的帮助,听力受损的人也难以感知音调。本文研究了两种方法来改善中文语音识别。在第一种方法中,TEPC经过非线性扩展过程,以增加与周期性相关的幅度波动的调制深度。听力测试的结果表明,TEPC扩展可显着提高音调识别的准确性。在第二种方法中,研究了增强噪声中的时间周期性提示的有效性。时间周期提示被简化为频率等于语音F0的正弦波。这导致在不同噪声水平下音调识别性能的一致性和显着提高。预期这部分研究将有助于设计CI处理策略,以有效地感知音调语言。

著录项

  • 作者

    Yuan, Meng.;

  • 作者单位

    The Chinese University of Hong Kong (Hong Kong).;

  • 授予单位 The Chinese University of Hong Kong (Hong Kong).;
  • 学科 Health Sciences Audiology.;Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2009
  • 页码 146 p.
  • 总页数 146
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

  • 入库时间 2022-08-17 11:38:24

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号