Generating Descriptions for Sequential Images with Local-Object Attention and Global Semantic Context Modelling

机译：使用局部对象注意和全局语义上下文建模生成顺序图像的描述

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose an end-to-end CNN-LSTM model for generating descriptions for sequential images with a local-object attention mechanism. To generate coherent descriptions, we capture global semantic context using a multilayer perceptron, which learns the dependencies between sequential images. A paralleled LSTM network is exploited for decoding the sequence descriptions. Experimental results show that our model outperforms the baseline across three different evaluation metrics on the datasets published by Microsoft.

机译：在本文中，我们提出了一种端到端CNN-LSTM模型，用于使用局部对象注意机制生成顺序图像的描述。为了生成连贯的描述，我们使用多层感知器捕获全局语义上下文，该感知器学习顺序图像之间的依赖性。利用并行LSTM网络对序列描述进行解码。实验结果表明，在Microsoft发布的数据集上，我们的模型在三个不同的评估指标上均优于基线。

著录项

来源
《Workshop on intelligent interactive systems and language generation》|2018年|3-8|共6页
会议地点 Tiburg(NL)
作者
Jing Su; Chenghua Lin; Mian Zhou; Qingyun Dai; Haoyu Lv;
展开▼
作者单位

Guangdong Ocean University;

University of Aberdeen;

Tianjin University of Technology;

Guangdong University of Technology;

Guangdong University of Technology;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Semantic binding, not attentional control, generates coherent global structure in children's narrative memory [J] . Anna Kapikiana* Josie Briscoea Journal of Cognitive Psychology . 2012,第6期

机译：语义绑定而非注意控制会在儿童的叙事记忆中产生连贯的整体结构
2. Semantic Segmentation on Remotely Sensed Images Using an Enhanced Global Convolutional Network with Channel Attention and Domain Specific Transfer Learning [J] . Teerapong Panboonyuen, Kulsawasd Jitkajornwanich, Siam Lawawirojwong, Remote Sensing . 2019,第1期

机译：使用具有通道注意和特定域转移学习的增强型全局卷积网络对遥感图像进行语义分割
3. Attention-based relation and context modeling for point cloud semantic segmentation [J] . Hu Zhiyu, Zhang Dongbo, Li Shuai, Computers & Graphics . 2020,第Auga期

机译：Point云语义分割的关注关系与上下文建模
4. Generating Descriptions for Sequential Images with Local-Object Attention and Global Semantic Context Modelling [C] . Jing Su, Chenghua Lin, Mian Zhou, International natural language generation conference . 2018

机译：使用本地对象关注的顺序图像和全局语义上下文建模生成描述
5. Balancing grammar and semantics in “comics”: Global structure in sequential image processing [D] . Cohn, Neil Thomas 2011

机译：平衡“漫画”中的语法和语义：顺序图像处理中的全局结构
6. Global-and-Local Context Network for Semantic Segmentation of Street View Images [O] . Chih-Yang Lin, Yi-Cheng Chiu, Hui-Fuang Ng, 2020

机译：用于街景图像语义分割的全局和局部上下文网络
7. Generating Description for Sequential Images with Local-Object Attention Conditioned on Global Semantic Context [O] . Jing Su, Chenghua Lin, Mian Zhou, 2018

机译：在全局语义上下文上生成具有本地对象注意的顺序图像的描述

Generating Descriptions for Sequential Images with Local-Object Attention and Global Semantic Context Modelling

摘要

著录项

相似文献

相关主题

期刊订阅