Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation

机译：外面阳光灿烂？通过引发问题来改善VQA中的答案一致性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

While models for Visual Queslion Answering (VQA) have steadily improved over the years, interacting with one quickly reveals that these models lack consistency. For instance, if a model answers "red" to "What color is the balloon?", it might answer "no" if asked, "Is the balloon red?". These responses violate simple notions of entailment and raise questions about how effectively VQA models ground language. In this work, we introduce a dataset. ConVQA, and metrics that enable quantitative evaluation of consistency in VQA. For a given observable fact in an image (e.g. the balloon's color), we generate a set of logically consistent question-answer (QA) pairs (e.g. Is the balloon red?) and also collect a human-annotated set of common-sense based consistent QA pairs (e.g. Is the balloon the same color as tomato sauce?). Further, we propose a consistency-improving data augmentation module, a Consistency Teacher Module (CTM). CTM automatically generates entailed (or similar-intent) questions for a source QA pair and fine-tunes the VQA model if the VQA's answer to the entailed question is consistent with the source QA pair. We demonstrate that our CTM-based training improves the consistency of VQA models on the ConVQA datasets and is a strong baseline for further research.

机译：多年来，虽然视觉Queslion Answering（VQA）模型已得到稳步改进，但与一个模型进行快速交互显示这些模型缺乏一致性。例如，如果模型对“气球是什么颜色？”回答“红色”，则如果被问到“气球是否是红色？”，它可能回答“否”。这些回答违背了简单的含意概念，并引发了有关VQA如何有效地模拟地面语言的问题。在这项工作中，我们介绍了一个数据集。 ConVQA，以及可以定量评估VQA一致性的指标。对于图像中给定的可观察事实（例如，气球的颜色），我们生成了一组逻辑上一致的问答（QA）对（例如，气球为红色吗？），并且还收集了一组基于人类注释的常识保持一致的质量检查对（例如，气球和番茄酱的颜色相同吗？）。此外，我们提出了一个提高一致性的数据增强模块，即一致性教师模块（CTM）。如果VQA对所涉及问题的答案与源QA对一致，则CTM会自动为源QA对生成涉及（或意图相似）的问题，并微调VQA模型。我们证明了我们基于CTM的培训提高了ConVQA数据集上VQA模型的一致性，并且为进一步研究奠定了坚实的基础。

著录项

来源
《International joint conference on natural language processing;Conference on empirical methods in natural language processing》|2019年|5859-5864|共6页
会议地点
作者
Arijit Ray; Karan Sikka; Ajay Divakaran; Stefan Lee; Giedrius Burachas;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool [J] . IEEE Transactions on Pattern Analysis and Machine Intelligence . 2020,第2期

机译：视觉反问题解答：一种新的基准和VQA诊断工具
2. Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering [J] . Goyal Yash, Khot Tejas, Agrawal Aishwarya, International Journal of Computer Vision . 2019,第4期

机译：在VQA问题中制作v：提升图像理解在视觉问题的回答中的作用
3. R-VQA: Learning Visual Relation Facts with Semantic Attention for Visual Question Answering [J] . Pan Lu, Lei Ji, Wei Zhang, SIGKDD explorations . 2018,第Udisk期

机译：R-VQA：学习具有语义关注的视觉关系事实，用于视觉问题应答
4. Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation [C] . Arijit Ray, Karan Sikka, Ajay Divakaran, International joint conference on natural language processing . 2019

机译：外面阳光和黑暗？！通过需要的问题改善VQA的答案一致性
5. Context Based Multi-Image Visual Question Answering (VQA) in Deep Learning [D] . Peddinti, Sudhakar Reddy. 2018

机译：深度学习中基于上下文的多图像视觉问答（VQA）
6. A question-entailment approach to question answering [O] . Asma Ben Abacha, Dina Demner-Fushman 2019

机译：问答式问题解答方法
7. Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation [O] . Arijit Ray, Karan Sikka, Ajay Divakaran, 2019

机译：外面阳光和黑暗？！通过需要的问题改善VQA的答案一致性

Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation

摘要

著录项

相似文献

相关主题

期刊订阅