Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog

机译：通过反复性的双重关注对视觉对话的多步推理

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a new model for visual dialog, Recurrent Dual Attention Network (ReDAN), using multi-step reasoning to answer a series of questions about an image. In each question-answering turn of a dialog, ReDAN infers the answer progressively through multiple reasoning steps. In each step of the reasoning process, the semantic representation of the question is updated based on the image and the previous dialog history, and the recurrently-refined representation is used for further reasoning in the subsequent step. On the VisDial v1.0 dataset, the proposed ReDAN model achieves a new state-of-the-art of 64.47% NDCG score. Visualization on the reasoning process further demonstrates that ReDAN can locate context-relevant visual and textual clues via iterative refinement, which can lead to the correct answer step-by-step.

机译：本文介绍了可视化对话框，经常性双重关注网络（redan）的新模型，使用多步推理来回答有关图像的一系列问题。在对话框的每个问题回答转弯时，redan通过多个推理步骤逐步递交答案。在推理过程的每个步骤中，基于图像和先前的对话历史来更新问题的语义表示，并且复合的表示用于在后续步骤中进一步推理。在Vidial V1.0 DataSet上，拟议的redan模型实现了最新的最先进的64.47％的NDCG得分。在推理过程中的可视化进一步展示了redan可以通过迭代细化来定位上下文相关的视觉和文本线索，这可以通过逐步导致正确的答案。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2019年|cxxxiv p. 5926-6603|共12页
会议地点
作者
Zhe Gan; Yu Cheng; Ahmed EI Kholy; Linjie Li; Jingjing Liu; Jianfeng Gao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Recurrent Attention Network with Reinforced Generator for Visual Dialog [J] . Fan Hehe, Zhu Linchao, Yang Yi, ACM transactions on multimedia computing communications and applications . 2020,第3期

机译：用于可视化对话框的经常性注意网络
2. DRAU: Dual Recurrent Attention Units for Visual Question Answering [J] . Osman Ahmed, Samek Wojciech Computer vision and image understanding . 2019,第AUGa期

机译：DRAU：视觉问题回答的双重循环注意力单元
3. DRAU: Dual Recurrent Attention Units for Visual Question Answering [J] . Osman Ahmed, Samek Wojciech Computer vision and image understanding . 2019,第Auga期

机译：DRAU：用于视觉问题的双重复发性注意力单位
4. Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog [C] . Zhe Gan, Yu Cheng, Ahmed EI Kholy, Annual meeting of the Association for Computational Linguistics . 2019

机译：通过可视对话的双向双重关注进行多步推理
5. Dialogic argumentation as a path to enhancing individual argumentive reasoning in academically disadvantaged 8th-graders. [D] . Goh, Wendy Wee Lyn. 2008

机译：对话论证是在学术上处于劣势的八年级学生增强个人论证推理的途径。
6. Modulation of Brain Activity by Selective Attention to Audiovisual Dialogues [O] . Alina Leminen, Maxime Verwoert, Mona Moisala, 2020

机译：通过选择性地关注视听对话来调制脑活动
7. Dual Attention Networks for Visual Reference Resolution in Visual Dialog [O] . Gi-Cheon Kang, Jaeseo Lim, Byoung-Tak Zhang 2019

机译：用于视觉对话框中的视觉参考分辨率的双重关注网络

Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog

摘要

著录项

相似文献

相关主题

期刊订阅