An Online Incremental Speaker Adaptation Method Using Speaker-Clustered Initial Models

机译：基于说话人聚类初始模型的在线增量说话人适应方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We previously proposed an incremental speaker adaptation method combined with automatic speaker-change detection for broadcast news transcription where speakers change frequently and each of them utters a series of several sentences. In this method, the speaker change is detected using speaker-independent and speaker-adaptive Gaussian mixture models (GMMs). Both phone HMMs and GMMs are incrementally adapted to each speaker by the combination of MLLR, MAP and VFS methods using speaker by the combination of MLLR, MAP and VFS methods using speaker-independent (SI) models as initial models. This paper proposes its improvement in which an initial model for speaker adaptation is selected from a set of models made by speaker clustering. Either cluster-dependent phone HMMs or GMMs are used to calculate the likelihood for selecting the best initial model. In a broadcast news transcription task, the proposed method significantly reduces word error rate compared with the method using SI-HMM as an initial model. Online incremental speaker adaptation results show that word errr rate is reduced by 11.6

机译：我们以前提出了一种增量说话人自适应方法，结合了自动说话人变化检测功能，用于广播新闻转录，其中说话人经常变化，每个说话人说出一系列的几个句子。在这种方法中，使用独立于说话者和自适应说话者的高斯混合模型（GMM）来检测说话者变化。通过使用扬声器的MLLR，MAP和VFS方法的组合，并且使用与扬声器无关的（SI）模型作为初始模型的MLLR，MAP和VFS的方法的组合，可以将电话HMM和GMM逐步适应每个扬声器。本文提出了一种改进，其中从说话者聚类所建立的一组模型中选择说话者适应的初始模型。取决于群集的电话HMM或GMM用于计算选择最佳初始模型的可能性。在广播新闻转录任务中，与使用SI-HMM作为初始模型的方法相比，该方法大大降低了单词错误率。在线增量说话人适应结果显示，单词错误率降低了11.6

著录项

来源
《6th International conference on Spoken Language Processing ICSLP 2000 Oct. 16-Oct.20 2000 Beijing International Convention Center, Beijing, China》|2000年|p.694-697|共4页
会议地点
作者
Zhipeng Zhang; Sadaoki Furui;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类世界各国文化与文化事业;
关键词

相似文献

外文文献
中文文献
专利

1. Bayesian Unsupervised Batch and Online Speaker Adaptation of Activation Function Parameters in Deep Models for Automatic Speech Recognition [J] . Zhen Huang, Sabato Marco Siniscalchi, Chin-Hui Lee Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2017,第1期

机译：用于语音识别的深度模型中激活函数参数的贝叶斯无监督批处理和在线说话者自适应
2. Online Speaker Clustering Using Incremental Learning of an Ergodic Hidden Markov Model [J] . Takafumi KOSHINAKA, Kentaro NAGATOMO, Koichi SHINODA IEICE transactions on information and systems . 2012,第10期

机译：使用增量学习的遍历隐马尔可夫模型进行在线说话人聚类
3. Online Speaker Clustering Using Incremental Learning of an Ergodic Hidden Markov Model [J] . Takafumi KOSHINAKA, Kentaro NAGATOMO, Koichi SHINODA IEICE Transactions on Information and Systems . 2012,第10期

机译：使用遍历隐马尔可夫模型的增量学习进行在线说话者聚类
4. An Online Incremental Speaker Adaptation Method Using Speaker-Clustered Initial Models [C] . Zhipeng Zhang, Sadaoki Furui International conference on spoken language processing . 2000

机译：使用扬声器聚类初始模型的在线增量扬声器适配方法
5. Speaker Characteristic-based Acoustic Model Adaptation Method for Speaker Recognition Systems [D] . Millington, Daniel S. 2011

机译：基于说话者特征的说话人识别系统声学模型自适应方法
6. Incremental Change or Initial Differences? Testing Two Models of Marital Deterioration [O] . Justin A. Lavner, Thomas N. Bradbury, Benjamin R. Karney -1

机译：增量变更或初始差异？测试两种婚姻恶化模型
7. Online Adaptation of Word-initial Ukrainian CC Consonant Clusters by Native Speakers of English [O] . Kateryna Laidler 2017

机译：英语母语讲话者在线适应词初初始乌克兰CC辅音群

An Online Incremental Speaker Adaptation Method Using Speaker-Clustered Initial Models

摘要

著录项

相似文献

相关主题

期刊订阅