Transfer learning for multimodal dialog

Shruti Palaskar; Ramon Sanabria; Florian Metze

首页> 外文期刊>Computer speech and language >Transfer learning for multimodal dialog

【24h】

Transfer learning for multimodal dialog

机译：转移学习对多模式对话

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Audio-Visual Scene-Aware Dialog (AVSD) is best understood as an extension of Visual Question Answering, the task of generating a textual answer in response to a textual question on multi-media content In AVSD, the answer-relevant "context" is expanded to include past dialog turns, which we view as a specialized form of extra textual knowledge (in addition to the standard video features). We have developed a framework that uses hierarchical attention to fuse contributions from different modalities, and had shown how it can be used to generate textual summaries from multi-modal sources, specifically videos with accompanying commentary. In this paper, we transfer the algorithmic approach, models, and data from this background corpus of 2000 h of how-to videos to the AVSD task, and report our findings. Our approach uses dialog context, but makes no assumption about the ordering of the history. Our system achieves the best performance in both automatic and human evaluations in the 7th Dialog State Tracking Challenge (AVSD).

机译：视听场景感知对话框（AVSD）最好被理解为视觉问题应答的扩展，在AVSD中的多媒体内容上的文本问题中生成文本答案的任务，答案相关的“上下文”是扩展到包含过去的对话框，我们将视为专用形式的额外文本知识（除标准视频功能之外）。我们开发了一个框架，它使用分层关注来自不同模式的保险丝贡献，并展示了它如何用于从多模态源生成文本摘要，特别是具有附带评论的视频。在本文中，我们将算法方法，模型和数据传输到AVSD任务的2000小时的背景语料库，并报告我们的调查结果。我们的方法使用对话框上下文，但没有假设历史记录的排序。我们的系统在第7个对话状态跟踪挑战（AVSD）中实现了自动和人类评估中的最佳性能。

著录项

来源
《Computer speech and language》 |2020年第11期|101093.1-101093.9|共9页
作者
Shruti Palaskar; Ramon Sanabria; Florian Metze;
展开▼
作者单位

Language Technologies Institute Carnegie Mellon University Pittsburgh PA USA;

Language Technologies Institute Carnegie Mellon University Pittsburgh PA USA;

Language Technologies Institute Carnegie Mellon University Pittsburgh PA USA;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Multimodal dialog; Video question answering; Transfer learning;

机译：多模式对话;视频问题应答;转移学习;

相似文献

外文文献
中文文献
专利

1. Learning human multimodal dialogue strategies [J] . V. RIESER, O. LEMON Natural language engineering . 2010,第pta1期

机译：学习人类多模式对话策略
2. A dialogue manager for multimodal human-robot interaction and learning of a humanoid robot [J] . Hartwig Holzapfel Industrial Robot . 2008,第6期

机译：用于多模式人机交互和类人机器人学习的对话管理器
3. A Platform for Output Dialogic Strategies in Natural Multimodal Dialogue Systems [J] . Meriam Horchani, Laurence Nigay, Franck Panaget Intelligence . 2007,第Sup期

机译：自然多模式对话系统中输出对话策略的平台
4. A platform for output dialogic strategies in natural multimodal dialogue systems [C] . Meriam Horchani, Laurence Nigay, Franck Panaget, International conference on Intelligent user interfaces . 2007

机译：自然多模式对话系统中输出对话策略的平台
5. Leveraging adolescents' multimodal literacies to promote dialogic discussions of literature in one secondary English classroom. [D] . Chisholm, James S. 2010

机译：在一个中学英语教室中，利用青少年的多式联运文学知识来促进文学的对话性讨论。
6. Dialogic Learning Environments That Enhance Instrumental Learning and Inclusion of Students With Special Needs in Secondary Education [O] . Diego Navarro-Mateu, Teresa Gómez-Domínguez, María Padrós Cuxart, 2021

机译：对话学习环境增强有助于学习和在中学教育中具有特殊需求的学生
7. Pontos de possível diálogo entre aprendizado multimodal e ensino-aprendizado de línguas estrangeiras Engaging multimodal learning and second/foreign language education in dialogue [O] . Miguel Angel Farias, Katica Obilinovic, Roxana Orrego 2011

机译：多模式学习与外语教学和学习之间可能的对话要点多模式学习和对话中的第二语言/外语教育

Transfer learning for multimodal dialog

摘要

著录项

相似文献

相关主题

期刊订阅