Analysing concatenation approaches to document-level NMT in two different domains

机译：分析两个不同域文档级NMT的串联方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we investigate how different aspects of discourse context affect the performance of recent neural MT systems. We describe two popular datasets covering news and movie subtitles and we provide a thorough analysis of the distribution of various document-level features in their domains. Furthermore, we train a set of context-aware MT models on both datasets and propose a comparative evaluation scheme that contrasts coherent context with artificially scrambled documents and absent context, arguing that the impact of discourse-aware MT models will become visible in this way. Our results show that the models are indeed affected by the manipulation of the test data, providing a different view on document-level translation quality than absolute sentence-level scores.

机译：在本文中，我们研究了语篇背景的不同方面如何影响最近神经MT系统的性能。我们描述了两个流行的数据集，涵盖了新闻和电影字幕，我们对其域中的各种文档级别功能的分布提供了全面的分析。此外，我们在数据集上培训一组背景感知MT模型，并提出了一种比较评估方案，这些方案与人工乱乱的文件和缺陷的上下文对比相干语境，争论话语感知MT模型的影响将以这种方式可见。我们的结果表明，该模型确实受到测试数据的操纵影响，在文档级翻译质量方面提供了与绝对句子级别分数的不同视图。

著录项

来源
《Workshop on discourse in machine translation》|2019年|63 p.|共11页
会议地点
作者
Yves Scherrer; Joerg Tiedemann; Sharid Loaiciga;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Domain-specific Named Entity Recognition with Document-Level Optimization [J] . Wang Limin, Li Shoushan, Yan Qian, ACM transactions on Asian language information processing . 2018,第4期

机译：具有文档级优化的特定于域的命名实体识别
2. The crystal structure of pyrimidine/thiamin biosynthesis precursor-like domain-containing protein CAE31940 from proteobacterium Bordetella bronchiseptica RB50, and evolutionary insight into the NMT1/THI5 family [J] . Jacek Bajor, Karolina L. Tkaczuk, Maksymilian Chruszcz, Journal of structural and functional genomics . 2014,第2期

机译：支气管杆菌博德特氏菌RB50的嘧啶/硫胺素类生物合成前体样结构域蛋白CAE31940的晶体结构，以及对NMT1 / THI5家族的进化见解
3. SPECIES DELIMITATION IN THE LICHENIZED FUNGAL GENUS VULPICIDA (PARMELIACEAE, ASCOMYCOTA) USING GENE CONCATENATION AND COALESCENT-BASED SPECIES TREE APPROACHES [J] . Saag Lauri, Mark Kristiina, Saag Andres, American journal of botany . 2014,第12期

机译：利用基因融合和基于凝结性的树种方法对田CHE化真菌（Vulpicida）（亚科，甲虫）进行物种界定
4. Analysing concatenation approaches to document-level NMT in two different domains [C] . Yves Scherrer, Joerg Tiedemann, Sharid Loaiciga Workshop on discourse in machine translation . 2019

机译：分析两个不同域中的文档级NMT的级联方法
5. Structural analyses of human and zebrafish P0 cytoplasmic domains and molecular characterization of P0 from Xenopus laevis [D] . Luo, Xiaoyang 2007

机译：人和斑马鱼P0胞质域的结构分析和非洲爪蟾P0的分子表征
6. Discovering cryptic species in the Aspiciliella intermutans complex (Megasporaceae, Ascomycota) – First results using gene concatenation and coalescent-based species tree approaches [O] . Zakieh Zakeri, Volker Otte, Harrie Sipman, 2015

机译：在互生Aspiciliella intermutans复合体（Megasporaceae，Ascomycota）中发现隐性物种–使用基因级联和基于聚结的物种树方法的第一个结果
7. Analysing concatenation approaches to document-level NMT in two different domains [O] . Yves Scherrer, Jörg Tiedemann, Sharid Loáiciga 2019

机译：分析两个不同域文档级NMT的串联方法

Analysing concatenation approaches to document-level NMT in two different domains

摘要

著录项

相似文献

相关主题

期刊订阅