首页> 外文会议>Annual meeting of the Association for Computational Linguistics >Bridging Languages through Images with Deep Partial Canonical Correlation Analysis
【24h】

Bridging Languages through Images with Deep Partial Canonical Correlation Analysis

机译:通过具有深部典型典型相关分析的图像的桥接语言

获取原文

摘要

We present a deep neural network that leverages images to improve bilingual text embeddings. Relying on bilingual image tags and descriptions, our approach conditions text embedding induction on the shared visual information for both languages, producing highly correlated bilingual embeddings. In particular, we propose a novel model based on Partial Canonical Correlation Analysis (PCCA). While the original PCCA finds linear projections of two views in order to maximize their canonical correlation conditioned on a shared third variable, we introduce a non-linear Deep PCCA (DPCCA) model, and develop a new stochastic iterative algorithm for its optimization. We evaluate PCCA and DPCCA on multilingual word similarity and cross-lingual image description retrieval. Our models outperform a large variety of previous methods, despite not having access to any visual signal during test time inference.
机译:我们展示了一个深度神经网络,利用图像来改善双语文本嵌入。依靠双语图像标签和描述,我们的方法条件条件文本嵌入了两种语言的共享视觉信息的归纳,产生高度相关的双语嵌入。特别是,我们提出了一种基于部分规范相关分析(PCCA)的新型模型。虽然原始PCCA发现两个视图的线性投影,以便最大化其在共享第三变量上的规范相关性,但我们引入了非线性深型PCCA(DPCCA)模型,并开发了一种新的随机迭代算法,以实现其优化。我们在多语言单词相似性和交叉舌图像描述中评估PCCA和DPCCA。尽管在测试时间推断过程中,我们的模型尽管没有访问任何可视信号,但我们的模型优于各种各样的先前方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号