...
首页> 外文期刊>Machine translation >N-gram posterior probability confidence measures for statistical machine translation: an empirical study
【24h】

N-gram posterior probability confidence measures for statistical machine translation: an empirical study

机译:统计机器翻译的N-gram后验概率置信度度量:一项实证研究

获取原文
获取原文并翻译 | 示例
           

摘要

We report an empirical study of n-gram posterior probability confidence measures for statistical machine translation (SMT). We first describe an efficient and practical algorithm for rapidly computing n-gram posterior probabilities from large translation word lattices. These probabilities are shown to be a good predictor of whether or not the n-gram is found in human reference translations, motivating their use as a confidence measure for SMT. Comprehensive n-gram precision and word coverage measurements are presented for a variety of different language pairs, domains and conditions. We analyze the effect on reference precision of using single or multiple references, and compare the precision of posteriors computed from k-best lists to those computed over the full evidence space of the lattice. We also demonstrate improved confidence by combining multiple lattices in a multi-source translation framework.
机译:我们报告了统计机器翻译(SMT)的n克后验概率置信度测度的经验研究。我们首先描述一种有效而实用的算法,用于从大型翻译词格中快速计算n元语法后验概率。这些概率显示出可以很好地预测是否在人类参考译文中找到了n-gram,从而激发了它们被用作SMT的置信度度量的可能性。针对各种不同的语言对,域和条件,提供了全面的n-gram精度和单词覆盖率测量。我们分析了使用单个或多个引用对引用精度的影响,并将从k最佳列表计算出的后验精度与在整个晶格证据空间上计算出的后代精度进行了比较。通过在多源翻译框架中组合多个晶格,我们还展示了增强的信心。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号