A speaker verification backend with robust performance across conditions

Luciana Ferrer; Mitchell McLaren; Niko Briimmer

首页> 外文期刊>Computer speech and language >A speaker verification backend with robust performance across conditions

【24h】

A speaker verification backend with robust performance across conditions

机译：扬声器验证后端，具有跨条件的强大性能

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we address the problem of speaker verification in conditions unseen or unknown during development. A standard method for speaker verification consists of extracting speaker embeddings with a deep neural network and processing them through a backend composed of probabilistic linear discriminant analysis (PLDA) and global logistic regression score calibration. This method is known to result in systems that work poorly on conditions different from those used to train the calibration model. We propose to modify the standard backend, introducing an adaptive calibrator that uses duration and other automatically extracted side-information to adapt to the conditions of the inputs. The backend is trained discriminatively to optimize binary cross-entropy. When trained on a number of diverse datasets that are labeled only with respect to speaker, the proposed backend consistently and, in some cases, dramatically improves calibration, compared to the standard PLDA approach, on a number of held-out datasets, some of which are markedly different from the training data. Discrimination performance is also consistently improved. We show that joint training of the PLDA and the adaptive calibrator is essential - the same benefits cannot be achieved when freezing PLDA and fine-tuning the calibrator. To our knowledge, the results in this paper are the first evidence in the literature that it is possible to develop a speaker verification system with robust out-of-the-box performance on a large variety of conditions.

机译：在本文中，我们在开发期间不知情或未知的条件下解决了发言者核查问题。扬声器验证的标准方法包括用深神经网络提取扬声器嵌入式，并通过由概率线性判别分析（PLDA）和全局逻辑回归评分校准组成的后端处理它们。已知这种方法导致系统在与用于训练校准模型的条件不同的条件下工作。我们建议修改标准后端，引入使用持续时间和其他自动提取的副信息的自适应校准器，以适应输入的条件。后端判断训练，以优化二进制交叉熵。当培训仅在扬声器标记的许多不同数据集时，与标准PLDA方法相比，在某些情况下，在某些情况下，始终如一地提高校准，其中一些包含的数据集与培训数据显着不同。歧视性能也一直有所改善。我们表明PLDA的联合培训和自适应校准器是必不可少的 - 在冻结PLDA和微调校准器时，无法实现相同的益处。为了我们的知识，本文的结果是文献中的第一个证据表明，可以在各种条件下开发一个具有强大的开箱性能的扬声器验证系统。

著录项

来源
《Computer speech and language》 |2022年第1期|101258.1-101258.23|共23页
作者
Luciana Ferrer; Mitchell McLaren; Niko Briimmer;
展开▼
作者单位

Instituto de Investigacion en Ciencias de la Computacion (ICC) CONICET-UBA Argentina;

Speech Technology and Research Lab (StarLab) SRI International USA;

Phonexia South Africa;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Speaker verification; Probabilistic linear discriminant analysis; Robust calibration;

机译：发言人核查;概率线性判别分析;强大的校准;

相似文献

外文文献
中文文献
专利

1. Early reflection detection using autocorrelation to improve robustness of speaker verification in reverberant conditions [J] . Khamis A. Al-Karawi, Duraid Y. Mohammed International journal of speech technology . 2019,第4期

机译：使用自相关的早期反射检测可提高混响条件下说话者验证的鲁棒性
2. Gammatone filterbank and symbiotic combination of amplitude and phase-based spectra for robust speaker verification under noisy conditions and compression artifacts [J] . Fedila M., Bengherabi M., Amrouche A. Multimedia Tools and Applications . 2018,第13期

机译：Gammatone滤波器组和基于幅度和相位的频谱的共生组合，可在嘈杂条件和压缩伪像下进行可靠的说话人验证
3. Performances of Qualitative Fusion Scheme for Multi-biometric Speaker Verification Systems in Noisy Condition [J] . Lydia Abdul Hamid, Dzati Athiar Ramli Journal of Applied Sciences . 2012,第12期

机译：噪声条件下多生物说话人验证系统定性融合方案的性能
4. A Discriminative Condition-Aware Backend for Speaker Verification [C] . Luciana Ferrer, Mitchell McLaren IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：区分性的条件感知后端，用于说话人验证
5. Robust Back-End Processing for Speaker Verification Under Language and Acoustic Mismatch Conditions [D] . Misra, Abhinav. 2017

机译：语言和声学不匹配条件下用于说话人验证的强大后端处理
6. Performance Evaluation of Gravity-Fed Water Treatment Systems in Rural Honduras: Verifying Robust Reduction of Turbidity and Escherichia coli during Wet and Dry Weather [O] . Yolanda M. Brooks, Erika A. Tenorio-Moncada, Nisarg Gohil, 2018

机译：洪都拉斯农村重力自给水处理系统的性能评估：验证在潮湿和干燥天气期间浊度和大肠杆菌的有效降低
7. A Speaker Verification Backend for Improved Calibration Performance across Varying Conditions [O] . Luciana Ferrer, Mitchell Mclaren 2020

机译：扬声器验证后端，以改善不同条件的校准性能

A speaker verification backend with robust performance across conditions

摘要

著录项

相似文献

相关主题

期刊订阅