Lexical Chains meet Word Embeddings in Document-level Statistical Machine Translation

机译：词汇链在文件级统计机器翻译中遇到Word Embeddings

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The phrase-based Statistical Machine Translation (SMT) approach deals with sentences in isolation, making it difficult to consider discourse context in translation. This poses a challenge for ambiguous words that need discourse knowledge to be correctly translated. We propose a method that benefits from the semantic similarity in lexical chains to improve SMT output by integrating it in a document-level decoder. We focus on word embeddings to deal with the lexical chains, contrary to the traditional approach that uses lexical resources. Experimental results on German→English show that our method produces correct translations in up to 88% of the changes, improving the translation in 36%-48% of them over the baseline.

机译：基于短语的统计机器翻译（SMT）接近孤立的句子，使得在翻译中难以考虑话语背景。这对需要正确翻译话语知识的模糊词来构成挑战。我们提出了一种从词汇链中的语义相似性中受益的方法，以通过将其集成在文档级解码器中来改善SMT输出。我们专注于嵌入词，以处理词汇链，违背使用词汇资源的传统方法。德语→英语的实验结果表明，我们的方法在最多88％的变化中产生了正确的翻译，在基线上提高了36％-48％的翻译。

著录项

来源
《Workshop on discourse in machine translation》|2017年|xi 121 p.|共11页
会议地点
作者
Laura Mascarell;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Translation of Untranslatable Words - Integration of Lexical Approximation and Phrase-Table Extension Techniques into Statistical Machine Translation [J] . Michael PAUL, Karunesh ARORA, Eiichiro SUMITA IEICE Transactions on Information and Systems . 2009,第12期

机译：不可译词的翻译-将词法近似和短语表扩展技术集成到统计机器翻译中
2. Function words in statistical machine-translated Chinese and original Chinese: A study into the translationese of machine translation systems [J] . Kuo Chen-li Digital scholarship in the humanities . 2019,第4期

机译：统计机器中的功能词 - 翻译的中国和原版中文：一项研究机器翻译系统的研究
3. Graph-Based Bilingual Word Embedding for Statistical Machine Translation [J] . Wang Rui, Zhao Hai, Ploux Sabine, ACM transactions on Asian language information processing . 2018,第4期

机译：统计机器翻译中基于图的双语词嵌入
4. Lexical Chains meet Word Embeddings in Document-level Statistical Machine Translation [C] . Laura Mascarell Workshop on discourse in machine translation . 2017

机译：词法链在文档级统计机器翻译中遇到单词嵌入
5. Lexical features for statistical machine translation. [D] . Devlin, Jacob. 2009

机译：统计机器翻译的词汇功能。
6. Lexical embeddings produce interference when they are morphologically unrelated to the words in which they are contained: Evidence from eye movements [O] . Kristin M. Weingartner, Barbara J. Juhasz, Keith Rayner -1

机译：当与含有它们的单词形态无关时词汇嵌入产生干扰：来自眼球运动的证据
7. Lexical Chains meet Word Embeddings in Document-level Statistical Machine Translation [O] . Mascarell, Laura 2017

机译：词汇链在文档级统计机器翻译中遇到单词嵌入

Lexical Chains meet Word Embeddings in Document-level Statistical Machine Translation

摘要

著录项

相似文献

相关主题

期刊订阅