【24h】

An Online Incremental Speaker Adaptation Method Using Speaker-Clustered Initial Models

机译:基于说话人聚类初始模型的在线增量说话人适应方法

获取原文

摘要

We previously proposed an incremental speaker adaptation method combined with automatic speaker-change detection for broadcast news transcription where speakers change frequently and each of them utters a series of several sentences. In this method, the speaker change is detected using speaker-independent and speaker-adaptive Gaussian mixture models (GMMs). Both phone HMMs and GMMs are incrementally adapted to each speaker by the combination of MLLR, MAP and VFS methods using speaker by the combination of MLLR, MAP and VFS methods using speaker-independent (SI) models as initial models. This paper proposes its improvement in which an initial model for speaker adaptation is selected from a set of models made by speaker clustering. Either cluster-dependent phone HMMs or GMMs are used to calculate the likelihood for selecting the best initial model. In a broadcast news transcription task, the proposed method significantly reduces word error rate compared with the method using SI-HMM as an initial model. Online incremental speaker adaptation results show that word errr rate is reduced by 11.6
机译:我们以前提出了一种增量说话人自适应方法,结合了自动说话人变化检测功能,用于广播新闻转录,其中说话人经常变化,每个说话人说出一系列的几个句子。在这种方法中,使用独立于说话者和自适应说话者的高斯混合模型(GMM)来检测说话者变化。通过使用扬声器的MLLR,MAP和VFS方法的组合,并且使用与扬声器无关的(SI)模型作为初始模型的MLLR,MAP和VFS的方法的组合,可以将电话HMM和GMM逐步适应每个扬声器。本文提出了一种改进,其中从说话者聚类所建立的一组模型中选择说话者适应的初始模型。取决于群集的电话HMM或GMM用于计算选择最佳初始模型的可能性。在广播新闻转录任务中,与使用SI-HMM作为初始模型的方法相比,该方法大大降低了单词错误率。在线增量说话人适应结果显示,单词错误率降低了11.6

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号