【24h】

Lattice Transformer for Speech Translation

机译:言语翻译的格子变压器

获取原文

摘要

Recent advances in sequence modeling have highlighted the strengths of the transformer architecture, especially in achieving state-of-the-art machine translation results. However, depending on the up-stream systems, e.g., speech recognition, or word segmentation, the input to translation system can vary greatly. The goal of this work is to extend the attention mechanism of the transformer to naturally consume the lattice in addition to the traditional sequential input. We first propose a general lattice transformer for speech translation where the input is the output of the automatic speech recognition (ASR) which contains multiple paths and posterior scores. To leverage the extra information from the lattice structure, we develop a novel controllable lattice attention mechanism to obtain latent representations. On the LDC Spanish-English speech translation corpus, our experiments show that lattice transformer generalizes significantly better and outperforms both a transformer baseline and a lattice LSTM. Additionally, we validate our approach on the WMT 2017 Chinese-English translation task with lattice inputs from different BPE segmentations. In this task, we also observe the improvements over strong baselines.
机译:序列建模的最新进展突出了变压器架构的优势,尤其是在实现最先进的机器翻译结果方面。然而,取决于上游系统,例如语音识别或单词分割,翻译系统的输入可以大大变化。这项工作的目标是扩大变压器的注意机制,以便除了传统的顺序输入之外,还可以避免晶格。我们首先提出一个用于语音翻译的一般格子变压器,其中输入是包含多个路径和后谱的自动语音识别(ASR)的输出。要利用晶格结构的额外信息,我们开发了一种新颖的可控格子注意机制,以获得潜在的表现。在LDC西班牙语语音翻译语料库中,我们的实验表明,格子变压器概括了变压器基线和晶格LSTM的显着更好,优于更好。此外,我们验证了我们在WMT 2017中文翻译任务上的方法,并使用不同BPE分割的晶格输入。在这项任务中,我们还观察到强大的基线的改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号