...
首页> 外文期刊>Multimedia Tools and Applications >Leveraging topical and positional cues for language modeling in speech recognition
【24h】

Leveraging topical and positional cues for language modeling in speech recognition

机译:利用主题和位置提示进行语音识别中的语言建模

获取原文
获取原文并翻译 | 示例
           

摘要

This paper investigates language modeling with topical and positional information for large vocabulary continuous speech recognition. We first compare among a few topic models both theoretically and empirically, including document topic models and word topic models. On the other hand, since for some spoken documents such as broadcast news stories, the composition and the word usage of documents of the same style are usually similar, the documents hence can be separated into partitions consisting of identical rhetoric or topic styles by the literary structures, like introductory remarks, elucidations of methodology or affairs, conclusions of the articles, references or footnotes of reporters, etc. We hence present two position-dependent language models for speech recognition by integrating word positional information into the exiting n-gram and topic models. The experiments conducted on broadcast news transcription seem to indicate that such position-dependent models obtain comparable results to the existing «-gram and topic models.
机译:本文研究了具有主题和位置信息的语言建模,以用于大词汇量连续语音识别。我们首先在理论上和经验上比较几个主题模型,包括文档主题模型和单词主题模型。另一方面,由于对于诸如广播新闻报导之类的某些口头文件来说,相同样式的文件的构成和单词用法通常是相似的,因此,这些文件可以由文学者分成由相同的修辞或主题样式组成的分区。结构,例如引言,方法论或事务的说明,文章的结论,记者的参考文献或脚注等。因此,我们通过将单词位置信息整合到现有的n-gram和主题中,提出了两种基于位置的语言模型,用于语音识别。楷模。在广播新闻转录上进行的实验似乎表明,这种与位置相关的模型可以获得与现有«-gram和主题模型类似的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号