Threshold Re-weighting Attention Mechanism for Speaker Verification

机译：说话人验证的阈值重加权注意机制

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

It is difficult for the method of average pooling to get the optimal utterance-level features in applications of end-to-end speaker verification, because the importance of each frame is considered to be equivalent. A novel end-to-end architecture of ResCNN based on threshold re-weighting attention mechanism is proposed. Firstly, attention mechanism is introduced into the process of converting frame-level into utterance-level features to obtain the important frames then the larger weights are given by training. Secondly, the weights less than the average value of all weights are set to zero due to the fact that less speaker information is contained, and others are re-weighting to obtain new coefficients. Experimental results show that the equal error rate (EER) of the proposed method is 10.88% on the Voxceleb1 dataset, which is 1.41% lower than that of the average pooling method. This shows that the frames containing more speaker information can be selected by the proposed method more effectively, thus the performance of speaker verification system is improved. Furthermore, the extended experiment shows that the proposed method is also applicable for noisy scenes.

机译：由于平均帧的重要性被认为是等效的，因此平均池合并方法很难在端到端说话者验证应用中获得最佳话语级别特征。提出了一种新的基于阈值重加权注意机制的ResCNN端到端架构。首先，将注意力机制引入到将帧级特征转换为话语级特征以获得重要帧的过程中，然后通过训练给出更大的权重。其次，由于包含较少的说话者信息，因此将小于所有权重平均值的权重设置为零，并对其他权重进行重新加权以获得新系数。实验结果表明，该方法在Voxceleb1数据集上的均等错误率（EER）为10.88％，比平均合并方法低1.41％。这表明通过所提出的方法可以更有效地选择包含更多说话者信息的帧，从而提高了说话者验证系统的性能。此外，扩展实验表明，该方法也适用于嘈杂的场景。

著录项

来源
《2018 IEEE 4th Information Technology and Mechatronics Engineering Conference》|2018年|971-974|共4页
会议地点 Chongqing(CN)
作者
Bo Li; Xiaodong Cai;
展开▼
作者单位

School of Information and Communication, Guilin University of Electronic Technology, Guilin, 541004, China;

School of Information and Communication, Guilin University of Electronic Technology, Guilin, 541004, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Training; Noise measurement; Feature extraction; Conferences; Computer architecture; Speaker recognition; Speech processing;

机译：培训;噪声测量;特征提取;会议;计算机体系结构;说话人识别;语音处理;;

相似文献

外文文献
中文文献
专利

1. Dilated residual networks with multi-level attention for speaker verification [J] . Wu Yanfeng, Guo Chenkai, Gao Hongcan, Neurocomputing . 2020,第Octa28期

机译：扩展剩余网络，具有多级关注扬声器验证
2. Feature-Based Attentional Weighting and Re-weighting in the Absence of Visual Awareness [J] . Lasse Güldener, Antonia Jüllig, David Soto, Frontiers in Human Neuroscience . 2021,第5期

机译：在没有视觉意识的情况下，基于特征的注意力和重量重量
3. Deep built-structure counting in satellite imagery using attention based re-weighting [J] . Shakeel Anza, Sultani Waqas, Ali Mohsen ISPRS Journal of Photogrammetry and Remote Sensing . 2019,第MAY期

机译：使用基于注意力的重新加权在卫星图像中进行深层结构计数
4. Threshold Re-weighting Attention Mechanism for Speaker Verification [C] . Bo Li, Xiaodong Cai IEEE Information Technology and Mechatronics Engineering Conference . 2018

机译：扬声器验证的阈值重新加权注意机制
5. Discriminative and generative approaches for long- and short-term speaker characteristics modeling: Application to speaker verification. [D] . Dehak, Najim. 2009

机译：长期和短期说话者特征建模的判别和生成方法：在说话者验证中的应用。
6. Bidirectional Attention for Text-Dependent Speaker Verification [O] . Xin Fang, Tian Gao, Liang Zou, 2020

机译：文本依赖扬声器验证的双向关注
7. Speaker Verification Employing Combinations of Self-Attention Mechanisms [O] . Ara Bae, Wooil Kim 2020

机译：扬声器核查采用自我关注机制的组合

Threshold Re-weighting Attention Mechanism for Speaker Verification

摘要

著录项

相似文献

相关主题

期刊订阅