首页> 外文会议>Workshop on discourse in machine translation >Analysing concatenation approaches to document-level NMT in two different domains
【24h】

Analysing concatenation approaches to document-level NMT in two different domains

机译:分析两个不同域文档级NMT的串联方法

获取原文

摘要

In this paper, we investigate how different aspects of discourse context affect the performance of recent neural MT systems. We describe two popular datasets covering news and movie subtitles and we provide a thorough analysis of the distribution of various document-level features in their domains. Furthermore, we train a set of context-aware MT models on both datasets and propose a comparative evaluation scheme that contrasts coherent context with artificially scrambled documents and absent context, arguing that the impact of discourse-aware MT models will become visible in this way. Our results show that the models are indeed affected by the manipulation of the test data, providing a different view on document-level translation quality than absolute sentence-level scores.
机译:在本文中,我们研究了语篇背景的不同方面如何影响最近神经MT系统的性能。我们描述了两个流行的数据集,涵盖了新闻和电影字幕,我们对其域中的各种文档级别功能的分布提供了全面的分析。此外,我们在数据集上培训一组背景感知MT模型,并提出了一种比较评估方案,这些方案与人工乱乱的文件和缺陷的上下文对比相干语境,争论话语感知MT模型的影响将以这种方式可见。我们的结果表明,该模型确实受到测试数据的操纵影响,在文档级翻译质量方面提供了与绝对句子级别分数的不同视图。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号