Robust Mixture Modeling Using T-Distribution: Application to Speaker ID

机译：使用T分布进行稳健的混合建模：应用于演讲者ID

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Robust stochastic modeling of speech is an important issue for the performance of practical applications. The Gaussian mixture model, GMM, is widely used in speaker ID, but its performance would get limited in the presence of unseen noise and distortions. Such noisy data, referred to as "outliers" for the original distribution, can be better represented by the use of heavy-tail distributions, such as Student's t-distribution. It provides a natural choice in which the heavy-tail can be controlled using the degrees-of-freedom parameter, v. We explore finite mixture of t-distributions model (tMM), to represent noisy speech data and show its robustness for speaker ID, compared to GMM. Using the TIMIT and NTIMIT databases, the recognition accuracy obtained are 100% and 79.68% with a 34 mixture tMM respectively much better than those reported in the literature.

机译：语音的鲁棒随机建模是实际应用性能的重要问题。高斯混合模型GMM被广泛用于扬声器ID，但是在存在看不见的噪声和失真的情况下其性能会受到限制。通过使用重尾分布（例如学生的t分布）可以更好地表示这种嘈杂的数据（对于原始分布称为“离群值”）。它提供了一种自然选择，其中可以使用自由度参数v控制重尾。我们探索t分布模型（tMM）的有限混合，以表示嘈杂的语音数据并显示其对说话者ID的鲁棒性，相比GMM。使用TIMIT和NTIMIT数据库，使用34种混合tMM分别获得的识别准确度分别为100％和79.68％，远胜于文献报道。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2010》|2011年|p.2758-2761|共4页
会议地点
作者
Harshavardhan. S; T. V. Sreenivas;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
t mixture model (tMM); GMM; robustness to outliers;

机译：t混合模型（tMM）; GMM;异常值的鲁棒性;

相似文献

外文文献
中文文献
专利

1. Robust t-distribution mixture modeling via spatially directional information [J] . Taisong Xiong, Lei Zhang, Zhang Yi Neural computing & applications . 2014,第6期

机译：通过空间方向信息进行稳健的t分布混合模型
2. Robust mixture modelling using multivariate t-distribution with missing information [J] . Hai xian Wang, Quan bing Zhang, Bin Luo, Pattern recognition letters . 2004,第6期

机译：使用缺少信息的多元t分布进行稳健的混合建模
3. Robust text-independent speaker identification using Gaussian mixture speaker models [J] . Reynolds D.A., Rose R.C. IEEE Transactions on Speech and Audio Proceeding . 1995,第1期

机译：使用高斯混合说话人模型进行鲁棒的与文本无关的说话人识别
4. Robust Mixture Modeling Using T-Distribution: Application to Speaker ID [C] . Harshavardhan. S, T. V. Sreenivas Annual conference of the International Speech Communication Association . 2010

机译：使用T分布鲁棒混合建模：对扬声器ID的应用
5. Robust Speaker Modeling in Non-Neutral Environments with Application to Large Scale Multi-Speaker Audio Streams [D] . Yu, Chengzhu. 2017

机译：非中性环境中的鲁棒扬声器建模及其在大规模多扬声器音频流中的应用
6. Model-based spike sorting with a mixture of drifting t-distributions [O] . Kevin Q. Shan, Evgueniy V. Lubenov, Athanassios G. Siapas -1

机译：基于模型的尖峰排序与漂移t分布的混合
7. Robust mixture regression models using t-distribution [O] . Wei Yan 100

机译：使用t分布的稳健混合回归模型
8. Using the t-Distribution In Small Area Estimation; An Application to SAIPE State Poverty Models [R] . Huang, E. T., Bell, W. R. 2011

机译：在小面积估计中使用t分布; saIpE国家贫困模型的应用

Robust Mixture Modeling Using T-Distribution: Application to Speaker ID

摘要

著录项

相似文献

相关主题

期刊订阅