首页> 外文会议>Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies >Bridge Correlational Neural Networks for Multilingual Multimodal Representation Learning
【24h】

Bridge Correlational Neural Networks for Multilingual Multimodal Representation Learning

机译:用于多语言多峰表示学习的桥梁相关神经网络

获取原文

摘要

Recently there has been a lot of interest in learning common representations for multiple views of data. Typically, such common representations are learned using a parallel corpus between the two views (say, 1M images and their English captions). In this work, we address a real-world scenario where no direct parallel data is available between two views of interest (say, V_1 and V_2) but parallel data is available between each of these views and a pivot view (V_3). We propose a model for learning a common representation for V_1, V_2 and V_3 using only the parallel data available between V_1V_3 and V_2V_3. The proposed model is generic and even works when there are n views of interest and only one pivot view which acts as a bridge between them. There are two specific downstream applications that we focus on (ⅰ) transfer learning between languages L_1,L_2,...,L_n using a pivot language L and (ⅱ) cross modal access between images and a language L_1 using a pivot language L_2. Our model achieves state-of-the-art performance in multilingual document classification on the publicly available multilingual TED corpus and promising results in multilingual multimodal retrieval on a new dataset created and released as a part of this work.
机译:最近,人们对学习多种数据视图的通用表示法非常感兴趣。通常,使用两个视图之间的平行语料库(例如1M图像及其英文字幕)来学习此类常见表示形式。在这项工作中,我们解决了一个现实世界场景,在该场景中,两个感兴趣的视图(例如V_1和V_2)之间没有直接的并行数据可用,但是在这些视图的每个视图与数据透视图(V_3)之间都提供了并行数据。我们提出了一个仅使用V_1V_3和V_2V_3之间可用的并行数据来学习V_1,V_2和V_3的通用表示的模型。提出的模型是通用的,甚至在存在n个感兴趣的视图且只有一个枢轴视图充当它们之间的桥梁时也可以使用。我们有两个特定的下游应用程序,我们专注于(using)使用枢轴语言L在语言L_1,L_2,...,L_n之间转移学习,以及(ⅱ)使用枢轴语言L_2在图像和语言L_1之间进行交叉模式访问。我们的模型在公开可用的多语种TED语料库上实现了多语种文档分类的最新性能,并在作为该工作的一部分而创建和发布的新数据集上,在多语种多模式检索中取得了可喜的成果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号