首页> 外文会议>International joint conference on natural language processing;Conference on empirical methods in natural language processing >Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation
【24h】

Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation

机译:外面阳光灿烂?通过引发问题来改善VQA中的答案一致性

获取原文

摘要

While models for Visual Queslion Answering (VQA) have steadily improved over the years, interacting with one quickly reveals that these models lack consistency. For instance, if a model answers "red" to "What color is the balloon?", it might answer "no" if asked, "Is the balloon red?". These responses violate simple notions of entailment and raise questions about how effectively VQA models ground language. In this work, we introduce a dataset. ConVQA, and metrics that enable quantitative evaluation of consistency in VQA. For a given observable fact in an image (e.g. the balloon's color), we generate a set of logically consistent question-answer (QA) pairs (e.g. Is the balloon red?) and also collect a human-annotated set of common-sense based consistent QA pairs (e.g. Is the balloon the same color as tomato sauce?). Further, we propose a consistency-improving data augmentation module, a Consistency Teacher Module (CTM). CTM automatically generates entailed (or similar-intent) questions for a source QA pair and fine-tunes the VQA model if the VQA's answer to the entailed question is consistent with the source QA pair. We demonstrate that our CTM-based training improves the consistency of VQA models on the ConVQA datasets and is a strong baseline for further research.
机译:多年来,虽然视觉Queslion Answering(VQA)模型已得到稳步改进,但与一个模型进行快速交互显示这些模型缺乏一致性。例如,如果模型对“气球是什么颜色?”回答“红色”,则如果被问到“气球是否是红色?”,它可能回答“否”。这些回答违背了简单的含意概念,并引发了有关VQA如何有效地模拟地面语言的问题。在这项工作中,我们介绍了一个数据集。 ConVQA,以及可以定量评估VQA一致性的指标。对于图像中给定的可观察事实(例如,气球的颜色),我们生成了一组逻辑上一致的问答(QA)对(例如,气球为红色吗?),并且还收集了一组基于人类注释的常识保持一致的质量检查对(例如,气球和番茄酱的颜色相同吗?)。此外,我们提出了一个提高一致性的数据增强模块,即一致性教师模块(CTM)。如果VQA对所涉及问题的答案与源QA对一致,则CTM会自动为源QA对生成涉及(或意图相似)的问题,并微调VQA模型。我们证明了我们基于CTM的培训提高了ConVQA数据集上VQA模型的一致性,并且为进一步研究奠定了坚实的基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号