首页> 外文期刊>International journal of computer processing of languages >Research of Vowel Mapping Theory for Speaker Identification of Chinese Mandarin
【24h】

Research of Vowel Mapping Theory for Speaker Identification of Chinese Mandarin

机译:元音映射理论在汉语普通话说话者识别中的研究

获取原文
获取原文并翻译 | 示例
           

摘要

This paper presents a novel framework of speaker identification based on Chinese vowel mapping theory. The base of this framework is the mapping from Chinese multi-phthong to monophthong phonemes. According to the contrast of spectrum, features, monophthong phoneme glide statistical distribution and the performance of vowel classification, we confirmed that Chinese vowels could be decomposed into several monophthong phonemes based on the short time characteristics. Then we developed a new mapping table from multi-phthong to monophthong phoneme to assist the latter research of a great deal of experiment and theory analysis. The novel framework added a special model to implement the decomposition and organized several monophthong classifiers to replace the traditional classification module, which can avoid the disturbance of semantic information and achieve higher performance. In this new framework, it adopts short time frame as the basic identify unit, which makes it more compatible to real time system. The sufficient theory analysis and experimental results showed that the presented model and algorithms based on novel framework have achieved higher accuracy, speed and enhanced the robustness in different conditions compared with many traditional methods. Especially, we succeed in separating personal identification information from semantic information based on classifying the Chinese vowel, which will be a new way to transform the text-independent system into text-dependent speaker recognition system.
机译:本文提出了一种基于汉语元音映射理论的说话人识别新框架。该框架的基础是从中国多声副音到单声副音素的映射。根据频谱,特征,单音素音滑度统计分布和元音分类性能的对比,我们证实了基于短时特征的汉语元音可以分解为几种单音素音素。然后,我们开发了一个新的映射表,从多phthong到monophthong音素,以协助后面的大量实验和理论分析研究。新的框架增加了一个特殊的模型来实现分解,并组织了多个单音分类器来代替传统的分类模块,从而避免了语义信息的干扰,并获得了更高的性能。在这个新的框架中,它采用短时间框架作为基本的识别单元,这使其与实时系统更加兼容。足够的理论分析和实验结果表明,与许多传统方法相比,所提出的基于新颖框架的模型和算法在不同条件下具有更高的精度,速度和更高的鲁棒性。尤其是,我们在对汉语元音进行分类的基础上成功地将个人识别信息从语义信息中分离出来,这将成为一种将文本无关系统转变为文本相关说话人识别系统的新途径。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号