首页> 外国专利> A Speaker Identification Method Based on DTW Algorithm

A Speaker Identification Method Based on DTW Algorithm

机译:基于DTW算法的说话人识别方法

摘要

#$%^&*AU2018102038A420190131.pdf#####ABSTRACT This invention is a speaker-recognition method, in speech recognition technology field, based on Dynamic Time Warping (DTW) Algorithm. The whole invention consists of several steps: conduct a pre-process of the speakers' auditory signals using normalization method, effective auditory segment detection and the Mel Frequency Cepstral Coefficients(MFCC) feature extraction, etc. ; divide the processed auditory signals into training sets and test sets; measure the distances between the amplitudes of each two factors in the two different sets using DTW Algorithm, and then, the one whose auditory signal distance is found the most frequently in the first K shortest distances would be identified as the test speaker, so that finalize the identification of the speech speaker. This invention does not require tester's participation and adjustment and can automatically perform feature extraction and recognition functions, which provides a reliable and fast DTW algorithm-based speaker identification technology.Fig. 5 Process diagram of MFCC arithmetic 25 . ,, , , FMCC 25: 20 Kl ||| n 15-5 -111 -00 100 00 300 00 500 600 700 800 frame number Fig. 6 The signal features extracted by MFCC
机译:#$%^&* AU2018102038A420190131.pdf #####抽象本发明是语音识别中的说话者识别方法技术领域,基于动态时间规整(DTW)算法。整个发明包括几个步骤:进行预处理说话人的听觉信号采用归一化方法,有效听觉节段检测和梅尔频率倒谱系数(MFCC)特征提取等;划分处理过的听觉信号分为训练集和测试集;测量距离两个不同集合中每个两个因子的振幅之间的关系,使用DTW算法,然后是听觉信号距离为在前K个最短距离中发现频率最高的是确定为测试演讲者,以便最终确定演讲者。本发明不需要测试人员的参与,并且调整并可以自动执行特征提取和识别功能,可提供可靠而快速的DTW基于算法的说话人识别技术。图5 MFCC算法流程图25。 、、、、25:20公斤||| ñ15-5 -111-00 100 00 300 00 500 600 700 800帧号图6提取的信号特征MFCC

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号