Treatment of Markup in Statistical Machine Translation

机译：统计机器翻译中标记的处理

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present work on handling XML markup in Statistical Machine Translation (SMT). The methods we propose can be used to effectively preserve markup (for instance inline formatting or structure) and to place markup correctly in a machine-translated segment. We evaluate our approaches with parallel data that naturally contains markup or where markup was inserted to create synthetic examples. In our experiments, hybrid reinsertion has proven the most accurate method to handle markup, while alignment masking and alignment reinsertion should be regarded as viable alternatives. We provide implementations of all the methods described and they are freely available as an open-source framework.

机译：我们介绍在统计机器翻译（SMT）中处理XML标记的工作。我们提出的方法可用于有效保留标记（例如，内联格式或结构）并将标记正确放置在机器翻译的段中。我们使用自然包含标记或在其中插入标记以创建综合示例的并行数据评估我们的方法。在我们的实验中，混合重新插入已被证明是处理标记的最准确方法，而对齐遮罩和重新插入对齐应被视为可行的选择。我们提供了所描述的所有方法的实现，它们可以作为开放源代码框架免费获得。

著录项

来源
《Workshop on discourse in machine translation》|2017年|36-46|共11页
会议地点
作者
Mathias Mueller;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Function words in statistical machine-translated Chinese and original Chinese: A study into the translationese of machine translation systems [J] . Kuo Chen-li Digital scholarship in the humanities . 2019,第4期

机译：统计机器中的功能词 - 翻译的中国和原版中文：一项研究机器翻译系统的研究
2. MTIL2017: Machine Translation Using Recurrent Neural Network on Statistical Machine Translation [J] . Sainik KumarMahata, DipankarDas, SivajiBandyopadhyay Journal of Intelligent Systems . 2019,第3期

机译：MTIL2017：使用统计机器翻译的经常性神经网络的机器翻译
3. Incorporating Statistical Machine Translation Word Knowledge Into Neural Machine Translation [J] . Xing Wang, Zhaopeng Tu, Min Zhang Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2018,第12期

机译：将统计机器翻译单词知识整合到神经机器翻译中
4. Treatment of Markup in Statistical Machine Translation [C] . Mathias Mueller Workshop on discourse in machine translation . 2017

机译：统计机器翻译中标记的治疗
5. Modeling, Relevance in Statistical Machine Translation: Scoring Aligment, Context, and Annotations of Translation Instances. [D] . Phillips, Aaron B. 2012

机译：统计机器翻译中的建模，相关性：评分实例，上下文和翻译实例注释。
6. 3145 An Evaluation of Machine Learning and Traditional Statistical Methods for Discovery in Large-Scale Translational Data [O] . Megan C Hollister, Jeffrey D. Blume 2019

机译：3145对机器学习和传统统计方法的评估以发现大规模翻译数据
7. Treatment of Markup in Statistical Machine Translation [O] . Müller, Mathias 2017

机译：统计机器翻译中标记的处理

Treatment of Markup in Statistical Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅