首页> 中文期刊> 《计算机技术与发展》 >基于压缩感知的鲁棒性说话人识别参数研究

基于压缩感知的鲁棒性说话人识别参数研究

         

摘要

奈奎斯特采样下的说话人识别,当为了确保高的识别率而采集较长时间说话人语音时,采样数据量特别大,其中有许多冗余造成了采样资源的浪费,压缩感知理论可以很好地解决此问题。基于压缩感知理论,文中利用行阶梯观测矩阵对信号进行投影,研究了压缩比与识别率的关系,在压缩比为1:2时,保证识别率的同时,使得采样数据量减少为原来的一半。在有噪环境下,将谱减法运用到压缩感知和特征提取过程中,在无需重构时域信号的前提下,直接从已估计的干净语音功率谱中提取具有鲁棒性的特征参数CS-SSMFCC( Compressed Sensing Spectral Subtraction Mel Frequency Cepstral Co-efficient)。实验结果表明,与传统的识别参数MFCC( Mel Frequency Cepstral Coefficient)相比,CS-SSMFCC可以有效地提高系统的鲁棒性,具有很好的抗噪性能。%Speaker recognition under Nyquist sampling has got a large amount of data in order to ensure a high recognition rate,resulting in a waste of sampling resources,and compressive sensing theory can solve this problem. Based on compressed sensing theory,it makes use of ladder observation matrix projection in this paper. When the compression ratio is 1:2,the system ensures the recognition rate,so that the sample data is reduced to half. Under noisy environment,spectral subtraction is applied in compressed sensing and feature extrac-tion,and feature parameters are extracted directly from estimated clean speech power spectrum CS-SSMFCC (Compressed Sensing Spec-tral Subtraction Mel Frequency Cepstral Coefficient) . Experimental results show that compared with the traditional identification parame-ter MFCC (Mel frequency Cepstral Coefficient),CS-SSMFCC based on spectral subtraction under CS framework can effectively im-prove the robustness of the system,with good anti-noise performance.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号