首页> 外国专利> DIMENSIONALITY REDUCTION OF BAUM-WELCH STATISTICS FOR SPEAKER RECOGNITION

DIMENSIONALITY REDUCTION OF BAUM-WELCH STATISTICS FOR SPEAKER RECOGNITION

机译:说话人识别的Baum-Welch统计量纲降维

摘要

In a speaker recognition apparatus, audio features are extracted from a received recognition speech signal, and first order Gaussian mixture model (GMM) statistics are generated therefrom based on a universal background model that includes a plurality of speaker models. The first order GMM statistics are normalized with regard to a duration of the received speech signal. The deep neural network reduces a dimensionality of the normalized first order GMM statistics, and outputs a voiceprint corresponding to the recognition speech signal
机译:在说话者识别装置中,基于包括多个说话者模型的通用背景模型,从接收到的识别语音信号中提取音频特征,并由此生成一阶高斯混合模型(GMM)统计信息。关于接收到的语音信号的持续时间,对一阶GMM统计进行归一化。深度神经网络降低了归一化GMM统计量的维数,并输出了与识别语音信号相对应的声纹

著录项

  • 公开/公告号WO2018053531A1

    专利类型

  • 公开/公告日2018-03-22

    原文格式PDF

  • 申请/专利权人 PINDROP SECURITY INC.;

    申请/专利号WO2017US52316

  • 发明设计人 KHOURY ELIE;GARLAND MATTHEW;

    申请日2017-09-19

  • 分类号G10L17/04;G10L17/18;

  • 国家 WO

  • 入库时间 2022-08-21 12:44:54

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号