首页> 外文会议>Proceedings of the speech recognition workshop >THE DEVELOPMENT OF THE 1996 HTK BROADCAST NEWS TRANSCRIPTION SYSTEM
【24h】

THE DEVELOPMENT OF THE 1996 HTK BROADCAST NEWS TRANSCRIPTION SYSTEM

机译:1996年HTK广播新闻报道系统的开发

获取原文
获取原文并翻译 | 示例

摘要

This paper describes our efforts in extending a large vocabulary speech recognition system to handle broadcast news transcription. Results using the 1995 DARPA H4 evaluation data set are presented for different front-end analyses and for the use of unsuper-vised model adaptation using maximum likelihood linear regression (MLLR). The HTK system for the 1996 H4 evaluation is then described. It includes a number of new features compared to previous HTK large vocabulary systems including decoder-guided segmentation, segment clustering, cache-based language modelling, and combined MAP and MLLR adaptation. The system makes multiple passes through the data and the detailed results of each pass are given. The overall word error rate obtained by the 1996 evaluation system was 27.5%, and a bug-fixed version reduced this to 26.6%.
机译:本文介绍了我们在扩展大型词汇语音识别系统以处理广播新闻转录方面的工作。使用了1995年DARPA H4评估数据集的结果针对不同的前端分析以及使用最大似然线性回归(MLLR)的无监督模型适应性提出。然后介绍了用于1996年H4评估的HTK系统。与以前的HTK大型词汇系统相比,它具有许多新功能,包括解码器引导的分段,分段聚类,基于缓存的语言建模以及MAP和MLLR组合的自适应功能。系统对数据进行多次遍历,并给出每次遍历的详细结果。 1996年评估系统获得的整体单词错误率是27.5%,而错误修复版本将其降低到26.6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号