首页> 外文会议>Annual conference of the International Speech Communication Association;INTERSPEECH 2010 >Robust Mixture Modeling Using T-Distribution: Application to Speaker ID
【24h】

Robust Mixture Modeling Using T-Distribution: Application to Speaker ID

机译:使用T分布进行稳健的混合建模:应用于演讲者ID

获取原文

摘要

Robust stochastic modeling of speech is an important issue for the performance of practical applications. The Gaussian mixture model, GMM, is widely used in speaker ID, but its performance would get limited in the presence of unseen noise and distortions. Such noisy data, referred to as "outliers" for the original distribution, can be better represented by the use of heavy-tail distributions, such as Student's t-distribution. It provides a natural choice in which the heavy-tail can be controlled using the degrees-of-freedom parameter, v. We explore finite mixture of t-distributions model (tMM), to represent noisy speech data and show its robustness for speaker ID, compared to GMM. Using the TIMIT and NTIMIT databases, the recognition accuracy obtained are 100% and 79.68% with a 34 mixture tMM respectively much better than those reported in the literature.
机译:语音的鲁棒随机建模是实际应用性能的重要问题。高斯混合模型GMM被广泛用于扬声器ID,但是在存在看不见的噪声和失真的情况下其性能会受到限制。通过使用重尾分布(例如学生的t分布)可以更好地表示这种嘈杂的数据(对于原始分布称为“离群值”)。它提供了一种自然选择,其中可以使用自由度参数v控制重尾。我们探索t分布模型(tMM)的有限混合,以表示嘈杂的语音数据并显示其对说话者ID的鲁棒性,相比GMM。使用TIMIT和NTIMIT数据库,使用34种混合tMM分别获得的识别准确度分别为100%和79.68%,远胜于文献报道。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号