首页> 外文期刊>IEICE Transactions on Information and Systems >Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition
【24h】

Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition

机译:演讲者语音识别的无监督演讲者自适应模型

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper, we propose a new speaker-class modeling and its adaptation method for the LVCSR system and evaluate the method on the Corpus of Spontaneous Japanese (GSJ). In this method, closer speakers are selected from training speakers and the acoustic models are trained by using their utterances for each evaluation speaker. One of the major issues of the speaker-class model is determining the selection range of speakers. In order to solve the problem, several models which have a variety of speaker range are prepared for each evaluation speaker in advance, and the most proper model is selected on a likelihood basis in the recognition step. In addition, we improved the recognition performance using unsupervised speaker adaptation with the speaker-class models. In the recognition experiments, a significant improvement could be obtained by using the proposed speaker adaptation based on speaker-class models compared with the conventional adaptation method.
机译:本文针对LVCSR系统提出了一种新的说话人分类模型及其适应方法,并对自发性日本语料库(GSJ)进行了评估。在这种方法中,从训练说话者中选择更近的说话者,并通过对每个评估说话者使用其发声来训练声学模型。扬声器类模型的主要问题之一是确定扬声器的选择范围。为了解决该问题,预先为每个评估说话者准备具有不同说话者范围的几个模型,并且在识别步骤中基于似然性选择最合适的模型。此外,我们通过对说话人类别的模型进行无监督的说话人自适应来提高识别性能。在识别实验中,与传统的自适应方法相比,通过使用基于说话者分类模型的拟议的说话者自适应,可以获得明显的改善。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号