首页> 外文会议>Workshop on discourse in machine translation >Lexical Chains meet Word Embeddings in Document-level Statistical Machine Translation
【24h】

Lexical Chains meet Word Embeddings in Document-level Statistical Machine Translation

机译:词汇链在文件级统计机器翻译中遇到Word Embeddings

获取原文

摘要

The phrase-based Statistical Machine Translation (SMT) approach deals with sentences in isolation, making it difficult to consider discourse context in translation. This poses a challenge for ambiguous words that need discourse knowledge to be correctly translated. We propose a method that benefits from the semantic similarity in lexical chains to improve SMT output by integrating it in a document-level decoder. We focus on word embeddings to deal with the lexical chains, contrary to the traditional approach that uses lexical resources. Experimental results on German→English show that our method produces correct translations in up to 88% of the changes, improving the translation in 36%-48% of them over the baseline.
机译:基于短语的统计机器翻译(SMT)接近孤立的句子,使得在翻译中难以考虑话语背景。这对需要正确翻译话语知识的模糊词来构成挑战。我们提出了一种从词汇链中的语义相似性中受益的方法,以通过将其集成在文档级解码器中来改善SMT输出。我们专注于嵌入词,以处理词汇链,违背使用词汇资源的传统方法。德语→英语的实验结果表明,我们的方法在最多88%的变化中产生了正确的翻译,在基线上提高了36%-48%的翻译。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号