Text Independent Automatic Speaker Recognition System Using Mel-Frequency Cepstrum Coefficient and Gaussian Mixture Models

Alfredo Maesa; Fabio Garzia; Michele Scarpiniti; Roberto Cusani

首页> 外文期刊>Journal of Information Security >Text Independent Automatic Speaker Recognition System Using Mel-Frequency Cepstrum Coefficient and Gaussian Mixture Models

【24h】

Text Independent Automatic Speaker Recognition System Using Mel-Frequency Cepstrum Coefficient and Gaussian Mixture Models

机译：基于Mel倒谱系数和高斯混合模型的文本独立说话人自动识别系统

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The aim of this paper is to show the accuracy and time results of a text independent automatic speaker recognition (ASR) system, based on Mel-Frequency Cepstrum Coefficients (MFCC) and Gaussian Mixture Models (GMM), in order to develop a security control access gate. 450 speakers were randomly extracted from the Voxforge.org audio database, their utterances have been improved using spectral subtraction, then MFCC were extracted and these coefficients were statistically analyzed by GMM in order to build each profile. For each speaker two different speech files were used: the first one to build the profile database, the second one to test the system performance. The accuracy achieved by the proposed approach is greater than 96% and the time spent for a single test run, implemented in Matlab language, is about 2 seconds on a common PC.

机译：本文的目的是展示基于梅尔倒谱倒谱系数（MFCC）和高斯混合模型（GMM）的文本独立自动说话人识别（ASR）系统的准确性和时间结果，以便开发安全控制检修门。从Voxforge.org音频数据库中随机提取了450个说话者，使用频谱相减法改善了他们的话语，然后提取了MFCC，并通过GMM对这些系数进行了统计分析，以建立每个配置文件。对于每个发言人，使用了两个不同的语音文件：第一个用于建立配置文件数据库，第二个用于测试系统性能。所提出的方法所达到的精度大于96％，并且在Matlab语言上实现的单次测试所花费的时间在普通PC上约为2秒。

著录项

来源
《Journal of Information Security》 |2012年第4期|共6页
作者
Alfredo Maesa; Fabio Garzia; Michele Scarpiniti; Roberto Cusani;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类安全保密;
关键词

相似文献

外文文献
中文文献
专利

1. Text-independent speaker identification system based on the histogram of DCT-cepstrum coefficients [J] . S. Al-Rawahy, A. Hossen, U. Heute International Journal of Knowledge-Based in Intelligent Engineering Systems . 2012,第3期

机译：基于DCT倒谱系数直方图的文本无关说话人识别系统
2. Robust text-independent speaker identification using Gaussian mixture speaker models [J] . Reynolds D.A., Rose R.C. IEEE Transactions on Speech and Audio Proceeding . 1995,第1期

机译：使用高斯混合说话人模型进行鲁棒的与文本无关的说话人识别
3. Analysis And Identification Of Emotion Specific Features For Speaker Independent Emotion Recognition System Using Gaussian Mixture Models (GMMs) [J] . J. Naga Padmaja, R. RajeswarRao Advances in computational sciences and technology . 2017,第8PTa2343a2506期

机译：基于高斯混合模型（GMM）的独立于说话人的情绪识别系统的情绪特定特征的分析和识别
4. Vector Quantization In Text Dependent Automatic Speaker Recognition Using Mel-frequency Cepstrum Coefficient [C] . AHSANUL KABIR, SHEIKH MOHAMMAD MASUDUL AHSAN WSEAS International Conferences . 2007

机译：矢量量化在文本依赖性自动扬声器识别中使用熔融频率综合扬声器识别
5. Mixtures of inverse covariances: Covariance modeling for Gaussian mixtures with applications to automatic speech recognition. [D] . Vanhoucke, Vincent. 2004

机译：逆协方差的混合：高斯混合的协方差建模及其在自动语音识别中的应用。
6. A Batch Rival Penalized Expectation-Maximization Algorithm for Gaussian Mixture Clustering with Automatic Model Selection [O] . Jiechang Wen, Dan Zhang, Yiu-ming Cheung, 2012

机译：具有自动模型选择的高斯混合聚类的批次竞争惩罚惩罚期望最大化算法
7. Text Independent Automatic Speaker Recognition System Using Mel-Frequency Cepstrum Coefficients and Gaussian Mixture Models [O] . Alfredo Maesa, Fabio Garzia, Michele Scarpiniti, 2012

机译：基于Mel倒谱系数和高斯混合模型的文本独立自动说话人识别系统

Text Independent Automatic Speaker Recognition System Using Mel-Frequency Cepstrum Coefficient and Gaussian Mixture Models

摘要

著录项

相似文献

相关主题

期刊订阅