首页> 外文会议>Workshop on discourse in machine translation >Analysing Coreference in Transformer Outputs
【24h】

Analysing Coreference in Transformer Outputs

机译:分析变压器输出中的共指

获取原文

摘要

We analyse coreference phenomena in three neural machine translation systems trained with different data settings with or without access to explicit intra- and cross-sentential anaphoric information. We compare system performance on two different genres: news and TED talks. To do this, we manually annotate (the possibly incorrect) coreference chains in the MT outputs and evaluate the coreference chain translations. We define an error typology that aims to go further than pronoun translation adequacy and includes types such as incorrect word selection or missing words. The features of coreference chains in automatic translations are also compared to those of the source texts and human translations. The analysis shows stronger potential translationese effects in machine translated outputs than in human translations.
机译:我们分析了三种神经机器翻译系统中的共指现象,这些系统经过不同的数据设置训练,可以访问或不访问显式的句内和跨句的照应信息。我们比较两种不同类型的系统性能:新闻和TED演讲。为此,我们在MT输出中手动注释(可能不正确)共参考链,并评估共参考链翻译。我们定义了一种错误类型,其目的是超越代词翻译的充分性,并包括诸如单词选择错误或单词丢失之类的类型。自动翻译中的共指链的特征也与源文本和人工翻译的特征进行了比较。分析表明,机器翻译输出中潜在的翻译效应比人工翻译中的翻译效应更强。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号