AUTOMATIC SPEAKER VERIFICATION EXPERIMENTS USING A CONTINUOUS SPEECH RECOGNIZER

机译：使用连续语音识别器自动扬声器验证实验

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper focuses on a special issue of biometrics – automatic speaker verification (ASV). There is a great interesting in developing and performance increasing of ASV applications because of the advantages offered comparing to other biometrical methods. The most important aspect is that such a speech processing application has a low implementation cost. State-of-the-art speaker recognizer are based on statistical models such as VQ, GMM or HMM. This work reports experiments on prompted text speaker verification on a Romanian corpus previously built. First, the continuous speech recognizer architecture is built at monophone level, context independent, single mixture. Then, two models are trained using appropriate data: a) Client model consisting of speaker dependent phonemes (SD) trained with few minutes of client's speech; and b) World model consisting of speaker independent phonemes (SI) trained with all sentences available in the database. Each phone has a two state left-right HMM with diagonal covariance matrices. The speaker verification system is textprompted as a sentence HMM is constructed for the key text by concatenating corresponding models. The normalized log-likelihood is computed and compared with a threshold to decide whether to accept or reject the speaker. In the verification stage, the normalized log-likelihood is computed by the difference between the log-likelihood obtained through Viterbi forced alignment of the client model and world model, respectively. Finally a procedure used to determine the verification system performances is presented, including FAR and FRR graphics vs. threshold, ROC curves and various criteria for threshold calibration.

机译：本文重点介绍了生物识别的特殊问题 - 自动扬声器验证（ASV）。由于与其他生物学方法相比，在ASV应用程序的开发和性能增加时，具有很大的兴趣。最重要的方面是这种语音处理应用程序具有低实现成本。最先进的扬声器识别器基于统计模型，例如VQ，GMM或HMM。这项工作报告了关于罗马尼亚语料库的提示文本扬声器验证的实验。首先，连续语音识别器架构建立在唯一的水平，上下文独立，单个混合物。然后，使用适当的数据训练两种模型：a）客户端模型，由讲话者依赖性音素（SD）组成，几分钟的客户的语音培训; b）由数据库中可用的所有句子培训的扬声器独立音素（si）组成的世界模型。每部手机都有两个左右HMM，具有对角协方差矩阵。扬声器验证系统是作为句子致句子的帖子，通过连接相应的模型来为关键文本构建。计算归一化的日志似然，并与阈值进行比较，以决定是否接受或拒绝扬声器。在验证阶段，通过分别通过客户模型和世界模型的Viterbi强制对准所获得的日志似然之间的差异来计算归一化的日志似然。最后提出了一种用于确定验证系统性能的过程，包括远程和FRR图形与阈值，ROC曲线和阈值校准的各种标准。

著录项

来源
《Conference on Speech Technology and Human-Computer Dialogue》|2007年||共10页
会议地点
作者
Doru MUNTEANU; Constantin PINTILIE; Laurentiu APOSTOL;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
Biometric; Automatic speaker verification systems; Acoustic modeling; Hidden Markov models;

机译：生物统计;自动扬声器验证系统;声学造型;隐藏马尔可夫模型;

相似文献

外文文献
中文文献
专利

1. TAMEEM V1.0: speakers and text independent Arabic automatic continuous speech recognizer [J] . Mohammad A. M. Abushariah International journal of speech technology . 2017,第2期

机译：TAMEEM V1.0：独立于扬声器和文本的阿拉伯语自动连续语音识别器
2. Arabic Speaker-Independent Continuous Automatic Speech Recognition Based on a Phonetically Rich and Balanced Speech Corpus [J] . Mohammad Abushariah, Raja Ainon, Roziati Zainuddin, The international arab journal of information technology . 2012,第1期

机译：基于语音丰富均衡的语料库的阿拉伯语独立于说话人的连续自动语音识别
3. Semi-supervised speech activity detection with an application to automatic speaker verification [J] . Alexey Sholokhov, Md Sahidullah, Tomi Kinnunen Computer speech and language . 2018,第JANa期

机译：半监督语音活动检测及其在自动说话者验证中的应用
4. AUTOMATIC SPEAKER VERIFICATION EXPERIMENTS USING A CONTINUOUS SPEECH RECOGNIZER [C] . Doru MUNTEANU, Constantin PINTILIE, Laurentiu APOSTOL Conference on Speech Technology and Human-Computer Dialogue . 2007

机译：使用连续语音识别器自动扬声器验证实验
5. The use of discrete distributions with a very large codebook for automatic speech recognition and speaker verification. [D] . Ye, Guoli. 2013

机译：离散分布与非常大的密码本的配合使用可用于自动语音识别和说话者验证。
6. Speaker verification based on the fusion of speech acoustics and inverted articulatory signals [O] . Ming Li, Jangwon Kim, Adam Lammert, -1

机译：基于语音声学和反向发音信号融合的说话人验证
7. Automatic disfluency removal on recognized spontaneous speech - rapid adaptation to speaker dependent disfluencies [O] . Matthias Honal, Tanja Schultz 2005

机译：识别自发语音的自动不流畅消除 - 快速适应说话者依赖性不流利

AUTOMATIC SPEAKER VERIFICATION EXPERIMENTS USING A CONTINUOUS SPEECH RECOGNIZER

摘要

著录项

相似文献

相关主题

期刊订阅