Improving PLDA speaker verification performance using domain mismatch compensation techniques

Md Hafizur Rahman; Ahilan Kanagasundaram; Ivan Himawan; David Dean; Sridha Sridharan

首页> 外文期刊>Computer speech and language >Improving PLDA speaker verification performance using domain mismatch compensation techniques

【24h】

Improving PLDA speaker verification performance using domain mismatch compensation techniques

机译：使用域失配补偿技术提高PLDA扬声器验证性能

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The performance of state-of-the-art i-vector speaker verification systems relies on a large amount of training data for probabilistic linear discriminant analysis (PLDA) modeling. During the evaluation, it is also crucial that the target condition data is matched well with the development data used for PLDA training. However, in many practical scenarios, these systems have to be developed, and trained, using data that is often outside the domain of the intended application, since the collection of a significant amount of in-domain data is often difficult. Experimental studies have found that PLDA speaker verification performance degrades significantly due to this development/evaluation mismatch. This paper introduces a domain-invariant linear discriminant analysis (DI-LDA) technique for out-domain PLDA speaker verification that compensates domain mismatch in the LDA sub-space. We also propose a domain-invariant probabilistic linear discriminant analysis (DI-PLDA) technique for domain mismatch modeling in the PLDA subspace, using only a small amount of in-domain data. In addition, we propose the sequential and score-level combination of DI-LDA, and DI-PLDA to further improve out-domain speaker verification performance. Experimental results show the proposed domain mismatch compensation techniques yield at least 27% and 14.5% improvement in equal error rate (EER) over a pooled PLDA system for telephone-telephone and interview-interview conditions, respectively. Finally, we show that the improvement over the baseline pooled system can be attained even when significantly reducing the number of in-domain speakers, down to 30 in most of the evaluation conditions.

机译：最新的i-vector说话者验证系统的性能依赖于大量的训练数据来进行概率线性判别分析（PLDA）建模。在评估期间，将目标条件数据与用于PLDA培训的开发数据进行良好匹配也至关重要。但是，在许多实际情况下，必须使用经常在预期应用程序范围之外的数据来开发和培训这些系统，因为通常很难收集大量域内数据。实验研究发现，由于这种开发/评估不匹配，PLDA说话人的验证性能会大大降低。本文介绍了一种用于域外PLDA说话人验证的域不变线性判别分析（DI-LDA）技术，该技术可补偿LDA子空间中的域失配。我们还提出了一种域不变概率线性判别分析（DI-PLDA）技术，用于PLDA子空间中的域失配建模，仅使用少量域内数据。此外，我们提出了DI-LDA和DI-PLDA的顺序和分数级别组合，以进一步提高域外说话者验证性能。实验结果表明，所提出的域失配补偿技术分别比电话电话和面试采访条件下的混合PLDA系统的均等错误率（EER）至少提高了27％和14.5％。最后，我们表明，即使大大减少了域内发言人的数量（在大多数评估条件下减少到30位），也可以实现对基线合并系统的改进。

著录项

来源
《Computer speech and language》 |2018年第1期|240-258|共19页
作者
Md Hafizur Rahman; Ahilan Kanagasundaram; Ivan Himawan; David Dean; Sridha Sridharan;
展开▼
作者单位

Speech and Audio Research Lab, SAIVT, Queensland University of Technology, Australia;

Speech and Audio Research Lab, SAIVT, Queensland University of Technology, Australia;

Speech and Audio Research Lab, SAIVT, Queensland University of Technology, Australia;

Speech and Audio Research Lab, SAIVT, Queensland University of Technology, Australia;

Speech and Audio Research Lab, SAIVT, Queensland University of Technology, Australia;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Speaker verification; I-vector; Domain mismatch compensation; DI-LDA; DI-PLDA; DI-PLDADI-LDA; Score fusion;

机译：说话者验证;载体域失配补偿;DI-LDA;DI-PLDA;DI-PLDA [DI-LDA];分数融合;

相似文献

外文文献
中文文献
专利

1. Speaker-Phrase-Specific Adaptation of PLDA Model for Improved Performance in Text-Dependent Speaker Verification [J] . Laskar Mohammad Azharuddin, Bhanja Chuya China, Laskar Rabul Hussain Circuits, systems and signal processing . 2021,第10期

机译：PLDA模型的扬声器 - 短语特定调整，提高文本依赖扬声器验证中的性能
2. Sentence-HMM state-based i-vector/PLDA modelling for improved performance in text dependent single utterance speaker verification [J] . Osman Büyük Signal Processing, IET . 2016,第8期

机译：基于Sentence-HMM状态的i-vector / PLDA建模可提高与文本相关的单个说话者说话人验证的性能
3. Improving the performance of GPLDA speaker verification using unsupervised inter-dataset variability compensation approaches [J] . Ahilan Kanagasundaram International journal of speech technology . 2018,第3期

机译：使用无监督的数据集间可变性补偿方法提高GPLDA说话人验证的性能
4. Improving out-domain PLDA speaker verification using unsupervised inter-dataset variability compensation approach [C] . Kanagasundaram Ahilan, Dean David, Sridharan Sridha IEEE International Conference on Acoustics, Speech and Signal Processing . 2015

机译：使用无监督的数据集间可变性补偿方法改善域外PLDA说话人验证
5. Deep Neural Network Based Speaker Verification Under Domain Mismatched Conditions [D] . Zhang, Chunlei. 2019

机译：基于深度神经网络的扬声器验证在域不匹配条件下
6. Physics-aspects of dose accuracy in high dose rate (HDR) brachytherapy: source dosimetry treatment planning equipment performance and in vivo verification techniques [O] . Antony Palmer, David Bradley, Andrew Nisbet 2012

机译：高剂量率（HDR）近距离放射治疗中剂量准确性的物理方面：放射源剂量测定治疗计划设备性能和体内验证技术
7. Improving out-domain PLDA speaker verification using unsupervised inter-dataset variability compensation approach [O] . Kanagasundaram Ahilan, Dean David, Sridharan Sridha 2015

机译：使用无监督的数据集间可变性补偿方法改善域外PLDA说话人验证
8. DOMAIN MISMATCH COMPENSATION FOR SPEAKER RECOGNITION USING A LIBRARY OF WHITENERS. [R] . Singer, E., Reynolds, D. A. 2015

机译：利用白人图书馆进行演讲者识别的域名失调补偿。

Improving PLDA speaker verification performance using domain mismatch compensation techniques

摘要

著录项

相似文献

相关主题

期刊订阅