首页> 外文会议>Conference on Speech Technology and Human-Computer Dialogue >Fast training of Large Margin diagonal Gaussian mixture models for speaker identification
【24h】

Fast training of Large Margin diagonal Gaussian mixture models for speaker identification

机译:扬声器识别的大型裕度对角线高斯混合模型的快速训练

获取原文

摘要

Gaussian mixture models (GMM) have been widely and successfully used in speaker recognition during the last decades. They are generally trained using the generative criterion of maximum likelihood estimation. In an earlier work, we proposed an algorithm for discriminative training of GMM with diagonal covariances under a large margin criterion. In this paper, we present a new version of this algorithm which has the major advantage of being computationally highly efficient. The resulting algorithm is thus well suited to handle large scale databases. We carry out experiments on a speaker identification task using NIST-SRE'2006 data and compare our new algorithm to the baseline generative GMM using different GMM sizes. The results show that our system significantly outperforms the baseline GMM in all configurations, and with high computational efficiency.
机译:高斯混合模型(GMM)已广泛且成功地在过去几十年中成功地用于扬声器识别。 通常使用最大似然估计的生成标准训练它们。 在早期的工作中,我们提出了一种在大幅标准下具有对角线协方差的GMM的判别训练算法。 在本文中,我们提出了一种新版本的该算法,其具有计算上高效的主要优点。 因此,所得到的算法非常适合处理大规模数据库。 我们使用NIST-SRE'2006数据对扬声器识别任务进行实验,并将我们的新算法与基线生成GMM进行比较,使用不同的GMM尺寸。 结果表明,我们的系统在所有配置中显着优于基线GMM,并具有高计算效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号