A Study on Universal Background Model Training in Speaker Verification

Hasan T.; Hansen J. H. L.

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >A Study on Universal Background Model Training in Speaker Verification

【24h】

A Study on Universal Background Model Training in Speaker Verification

机译：说话人验证中的通用背景模型训练研究

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

State-of-the-art Gaussian mixture model (GMM)-based speaker recognition/verification systems utilize a universal background model (UBM), which typically requires extensive resources, especially if multiple channel and microphone categories are considered. In this study, a systematic analysis of speaker verification system performance is considered for which the UBM data is selected and purposefully altered in different ways, including variation in the amount of data, sub-sampling structure of the feature frames, and variation in the number of speakers. An objective measure is formulated from the UBM covariance matrix which is found to be highly correlated with system performance when the data amount was varied while keeping the UBM data set constant, and increasing the number of UBM speakers while keeping the data amount constant. The advantages of feature sub-sampling for improving UBM training speed is also discussed, and a novel and effective phonetic distance-based frame selection method is developed. The sub-sampling methods presented are shown to retain baseline equal error rate (EER) system performance using only 1% of the original UBM data, resulting in a drastic reduction in UBM training computation time. This, in theory, dispels the myth of “There''s no data like more data” for the purpose of UBM construction. With respect to the UBM speakers, the effect of systematically controlling the number of training (UBM) speakers versus overall system performance is analyzed. It is shown experimentally that increasing the inter-speaker variability in the UBM data while maintaining the overall total data size constant gradually improves system performance. Finally, two alternative speaker selection methods based on different speaker diversity measures are presented. Using the proposed schemes, it is shown that by selecting a diverse set of UBM speakers, the baseline system performance can be retained using less than 30% of the original UBM speakers.

机译：基于最新高斯混合模型（GMM）的说话人识别/验证系统利用通用背景模型（UBM），通常需要大量资源，尤其是在考虑多个通道和麦克风类别的情况下。在这项研究中，考虑了对说话人验证系统性能的系统分析，为此选择了UBM数据并以不同的方式有针对性地对其进行了更改，包括数据量的变化，特征帧的子采样结构以及数量的变化。扬声器。根据UBM协方差矩阵制定了一个客观指标，发现该指标与系统性能高度相关，当改变数据量的同时保持UBM数据集不变，并增加UBM说话者的人数，同时保持数据量不变。还讨论了特征子采样对提高UBM训练速度的优势，并开发了一种新颖有效的基于语音距离的帧选择方法。显示的子采样方法仅使用原始UBM数据的1％即可保持基线均等错误率（EER）系统性能，从而大大减少了UBM训练计算时间。从理论上讲，这消除了出于UBM构建的目的“没有数据就像更多数据”的神话。对于UBM扬声器，分析了系统控制培训（UBM）扬声器数量与整体系统性能的影响。实验表明，在保持整体总数据大小不变的同时，增加UBM数据中的说话者间差异会逐步改善系统性能。最后，提出了两种基于不同说话人多样性测度的说话人选择方法。使用所提出的方案表明，通过选择多样化的UBM扬声器集，可以使用不到30％的原始UBM扬声器保持基线系统性能。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2011年第7期|p.1890-1899|共10页
作者
Hasan T.; Hansen J. H. L.;
展开▼
作者单位

Center for Robust Speech Systems (CRSS), Erik Jonsson School of Engineering and Computer Science, University of Texas at Dallas, Richardson, TX, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Acoustic modeling; intelligent speaker selection; speaker recognition; speaker verification; universal background model (UBM);

机译：声学建模;说话人智能选择;说话人识别;说话人验证;通用背景模型（UBM）;

相似文献

外文文献
中文文献
专利

1. OPTIMAL UNIVERSAL BACKGROUND MODEL IN AUTOMATIC SPEAKER VERIFICATION [J] . Hayet Djellali, Mohamed Tayeb Laskri Computers & Structures . 2013,第2期

机译：自动扬声器验证中的最佳通用背景模型
2. Towards an Optimal Speaker Modeling in Speaker Verification Systems using Personalized Background Models [J] . Ayoub Bouziane, Jamal Kharroubi, Arsalane Zarghili International Journal of Electrical and Computer Engineering . 2017,第6期

机译：使用个性化背景模型实现说话人验证系统中的最佳说话人建模
3. Speaker Model Clustering to Construct Background Models for Speaker Verification [J] . Disken Gokay, Tufekci Zekeriya, Cevik Ulus Archives of acoustics . 2017,第1期

机译：说话人模型聚类为说话人验证构建背景模型
4. A novel feature sub-sampling method for efficient universal background model training in speaker verification [C] . Hasan, Taufiq, Lei, Yun, Chandrasekaran, Aravind, IEEE International Conference on Acoustics Speech and Signal;ICASSP 2010 . 2010

机译：一种有效的说话人验证通用背景模型训练的特征子采样方法
5. Reducing computation in speaker recognition systems using a tree-structured universal background model. [D] . McClanahan, Richard Daniel. 2014

机译：使用树型通用背景模型来减少说话人识别系统中的计算。
6. Classification of ADHD and Non-ADHD Subjects Using a Universal Background Model [O] . Juan Lopez Marcano, Martha Ann Bell, A. A. (Louis) Beex -1

机译：使用通用背景模型对ADHD和非ADHD受试者进行分类
7. Towards an Optimal Speaker Modeling in Speaker Verification Systems using Personalized Background Models [O] . Ayoub Bouziane, Jamal Kharroubi, Arsalane Zarghili 2017

机译：朝着使用个性化背景模型的扬声器验证系统中的最佳扬声器建模
8. Tests Results Advanced Development Models of BISS Identity Verification Equipment. Volume II. Automatic Speaker Verification. [R] . foodman,martin j. 1978

机译：测试结果BIss身份验证设备的高级开发模型。第二卷。自动扬声器验证。

A Study on Universal Background Model Training in Speaker Verification

摘要

著录项

相似文献

相关主题

期刊订阅