Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions

机译：VQA中的问题相关性：识别非视觉和错误前提问题

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Visual Question Answering (VQA) is the task of answering natural-language questions about images. We introduce the novel problem of determining the relevance of questions to images in VQA. Current VQA models do not reason about whether a question is even related to the given image (e.g., What is the capital of Argentina?) or if it requires information from external resources to answer correctly. This can break the continuity of a dialogue in human-machine interaction. Our approaches for determining relevance are composed of two stages. Given an image and a question, (1) we first determine whether the question is visual or not, (2) if visual, we determine whether the question is relevant to the given image or not. Our approaches, based on LSTM-RNNs, VQA model uncertainty, and caption-question similarity, are able to outperform strong baselines on both relevance tasks. We also present human studies showing that VQA models augmented with such question relevance reasoning are perceived as more intelligent, reasonable, and human-like.

机译：视觉问题应答（VQA）是回答有关图像的自然语言问题的任务。我们介绍了确定对VQA中的图像的相关性的新问题。目前的VQA模型不会有所理由是一个问题甚至与给定图像有关（例如，阿根廷的资本是什么？）或者如果它需要从外部资源的信息正确回答。这可以打破人机交互中对话的连续性。我们确定相关性的方法由两个阶段组成。给定图像和问题，（1）我们首先确定问题是否是视觉上的，（2）如果是视觉，我们确定问题是否与给定的图像相关。我们的方法基于LSTM-RNNS，VQA模型不确定性和标题 - 问题相似性，能够在相关任务中优于强大的基线。我们还提出了人类研究，表明VQA模型增强了这些问题相关推理被认为更聪明，合理和人类。

著录项

来源
《Conference on empirical methods in natural language processing》|2016年|919-924|共6页
会议地点
作者
Arijit Ray; Gordon Christie; Mohit Bansal; Dhruv Batra; Devi Parikh;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool [J] . IEEE Transactions on Pattern Analysis and Machine Intelligence . 2020,第2期

机译：视觉反问题解答：一种新的基准和VQA诊断工具
2. Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering [J] . Goyal Yash, Khot Tejas, Agrawal Aishwarya, International Journal of Computer Vision . 2019,第4期

机译：在VQA问题中制作v：提升图像理解在视觉问题的回答中的作用
3. R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering [J] . Pan Lu, Lei Ji, Wei Zhang, SIGKDD explorations . 2018,第Udisk期

机译：R-VQA：学习具有语义关注的视觉关系事实，用于视觉问题应答
4. Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions [C] . Arijit Ray, Gordon Christie, Mohit Bansal, Conference on empirical methods in natural language processing . 2016

机译：在VQA中的问题相关性：识别非视觉和虚假前提问题
5. Context Based Multi-Image Visual Question Answering (VQA) in Deep Learning [D] . Peddinti, Sudhakar Reddy. 2018

机译：深度学习中基于上下文的多图像视觉问答（VQA）
6. Cutting a Long Story Short? The Clinical Relevance of Asking Parents Nurses and Young Children Themselves to Identify Childrens Mental Health Problems by One or Two Questions [O] . Anne-Mari Borg, Raili Salmelin, Matti Joukamaa, -1

机译：长话短说？要求家长护士和幼儿自己通过一两个问题识别儿童的心理健康问题的临床意义
7. Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions [O] . Ray, Arijit, Christie, Gordon, Bansal, Mohit, 2016

机译：VQa中的问题相关性：识别非视觉和虚假前提问题

Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions

摘要

著录项

相似文献

相关主题

期刊订阅