Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings

机译：使用多模态嵌入的动词无监督视觉歧义消除

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce a new task, visual sense disambiguation for verbs: given an image and a verb, assign the correct sense of the verb, i.e., the one that describes the action depicted in the image. Just as textual word sense disambiguation is useful for a wide range of NLP tasks, visual sense disambiguation can be useful for multimodal tasks such as image retrieval, image description, and text illustration. We introduce VerSe, a new dataset that augments existing multimodal datasets (COCO and TUHOI) with sense labels. We propose an unsupervised algorithm based on Lesk which performs visual sense disambiguation using textual, visual, or multimodal embeddings. We find that textual embeddings perform well when gold-standard textual annotations (object labels and image descriptions) are available, while multimodal embeddings perform well on unanno-tated images. We also verify our findings by using the textual and multimodal embeddings as features in a supervised setting and analyse the performance of visual sense disambiguation task.

机译：我们引入了一项新任务，即动词的视觉歧义消除：给定一个图像和一个动词，为动词分配正确的意义，即描述图像中描述的动作的动词。正如文本意义上的歧义消除可用于多种NLP任务一样，视觉意义上的歧义消除可用于多模式任务，例如图像检索，图像描述和文本插图。我们介绍了VerSe，这是一个新的数据集，它使用感官标签扩充了现有的多峰数据集（COCO和TUHOI）。我们提出了一种基于Lesk的无监督算法，该算法使用文本，视觉或多模式嵌入来执行视觉歧义消除。我们发现，当可以使用黄金标准的文本注释（对象标签和图像描述）时，文本嵌入效果很好，而在无注释的图像上，多峰嵌入效果很好。我们还通过使用文本和多模式嵌入作为监督环境中的功能来验证我们的发现，并分析视觉消除歧义任务的性能。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2016年|182-192|共11页
会议地点
作者
Spandana Gella; Mirella Lapata; Frank Keller;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Improving English verb sense disambiguation performance with linguistically motivated features and clear sense distinction boundaries [J] . Jinying Chen, Martha S. Palmer Computers and the Humanities . 2009,第2期

机译：借助语言动机特征和清晰的语言区分界限，提高英语动词的歧义消除性能
2. Processing of Arabic Diacritical Marks: Phonological-Syntactic Disambiguation of Homographic Verbs and Visual Crowding Effects [J] . Hermena Ehab W., Drieghe Denis, Hellmuth Sam, Journal of experimental psychology. human perception and performance . 2015,第2期

机译：阿拉伯变音符号的处理：谐音动词的音韵句法歧义消除和视觉拥挤效应
3. Exploratory Study of Word Sense Disambiguation Methods for Verbs in Brazilian Portuguese [J] . MARCO ANTONIO SOBREVILLA CABEZUDO, THIAGO ALEXANDRE SALGUEIRO PARDO International journal of computational linguistics and applications . 2015,第1期

机译：巴西葡萄牙语动词词义消歧方法的探索性研究
4. Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings [C] . Spandana Gella, Mirella Lapata, Frank Keller Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2016

机译：使用多模式嵌入式的动词无监督的视觉感消解
5. Translating the Italian experience: An analysis of verbs of cognition and perception to support sense disambiguation in machine translation [D] . Vanni, Michelle. 2000

机译：翻译意大利经验：对认知和感知动词的分析，以支持机器翻译中的歧义消除
6. Research and applications: Word sense disambiguation in the clinical domain: a comparison of knowledge-rich and knowledge-poor unsupervised methods [O] . Rachel Chasin, Anna Rumshisky, Ozlem Uzuner, 2014

机译：研究与应用：临床领域中的单词歧义消除：知识丰富和知识匮乏的无监督方法的比较
7. Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings [O] . Gella, Spandana, Lapata, Maria, Keller, Frank 2016

机译：使用多模嵌入的动词无监督视觉消歧
8. Searching Semantic Resources for Complex Selectional Restrictions to Support Verb Sense Disambiguation. [R] . Taylor, M., Carlson, L., Poisson, S., 2010

机译：搜索语义资源的复杂选择限制以支持动词意义消歧。

Unsupervised Visual Sense Disambiguation for Verbs using Multimodal Embeddings

摘要

著录项

相似文献

相关主题

期刊订阅