Topic-Based Image Caption Generation

Sandeep Kumar Dash; Shantanu Acharya; Partha Pakray; Ranjita Das; Alexander Gelbukh

首页> 外文期刊>Arabian Journal for Science and Engineering >Topic-Based Image Caption Generation

【24h】

Topic-Based Image Caption Generation

机译：基于主题的图像标题生成

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Image captioning is to generate captions for a given image based on the content of the image. To describe an image efficiently,it requires extracting as much information from it as possible. Apart from detecting the presence of objects and their relativeorientation, the respective purpose intending the topic of the image is another vital information which can be incorporatedwith the model to improve the efficiency of the caption generation system. The sole aim is to put extra thrust on the contextof the image imitating human approach, as the mere presence of objects which may not be related to the context representingthe image should not be a part of the generated caption. In this work, the focus is on detecting the topic concerning the imageso as to guide a novel deep learning-based encoder–decoder framework to generate captions for the image. The method iscompared with some of the earlier state-of-the-art models based on the result obtained from MSCOCO 2017 training dataset. BLEU, CIDEr, ROGUE-L, METEOR scores are used to measure the efficacy of the model which show improvement inperformance of the caption generation process.

机译：图像字幕是根据图像的内容为给定图像生成字幕。为了有效地描述图像，需要从图像中提取尽可能多的信息。除了检测物体的存在及其相对方位之外，打算作为图像主题的各个目的是另一个重要信息，可以将其与模型结合使用以提高字幕生成系统的效率。唯一的目的是在模仿人的方法的图像的上下文上施加额外的推力，因为可能与表示图像的上下文无关的对象的存在不应该是所生成字幕的一部分。在这项工作中，重点是检测与图像有关的主题，以指导新颖的基于深度学习的编码器-解码器框架为图像生成字幕。基于从MSCOCO 2017训练数据集获得的结果，该方法与一些较早的最新模型进行了比较。 BLEU，CIDEr，ROGUE-L，METEOR得分用于衡量模型的有效性，该模型显示字幕生成过程的性能有所提高。

著录项

来源
《Arabian Journal for Science and Engineering》 |2020年第4期|3025-3034|共10页
作者
Sandeep Kumar Dash; Shantanu Acharya; Partha Pakray; Ranjita Das; Alexander Gelbukh;
展开▼
作者单位

Department of CSE NIT Mizoram Aizawl India;

Department of CSE NIT Silchar Silchar India;

CIC IPN Mexico Mexico;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Image caption generation; Deep learning; Topic modelling;

机译：图像标题生成;深度学习;主题建模;

相似文献

外文文献
中文文献
专利

1. Novel model to integrate word embeddings and syntactic trees for automatic caption generation from images [J] . Soft computing: A fusion of foundations, methodologies and applications . 2020,第2期

机译：从图像中集成Word Embeddings和Syntactic树的小说模型
2. Image caption generation with high-level image features [J] . Ding Songtao, Qu Shiru, Xi Yuling, Pattern recognition letters . 2019,第MAY期

机译：具有高级图像功能的图像标题生成
3. Automatic Caption Generation for News Images [J] . Feng Yansong, Lapata Mirella Pattern Analysis and Machine Intelligence, IEEE Transactions on . 2013,第4期

机译：自动为新闻图像生成字幕
4. MobileNet-based Neural Image Caption Model in Title Generation for Product's Images [C] . Irfan I. Amal, Dwi H. Widyantoro, Ardian Umam International Conference on Advance Informatics: Concepts, Theory and Applications . 2020

机译：基于MobileNet的神经图像标题模型在产品图像的标题生成中
5. Generation of Humorous Caption for Cartoon Images Using Deep Learning [D] . Shanmuga Sundaram, Rajesh. 2018

机译：使用深度学习的卡通形象的幽默标题
6. An Overview of Image Caption Generation Methods [O] . Haoran Wang, Yue Zhang, Xiaosheng Yu 2020

机译：图像字幕生成方法概述
7. Automatic Sentence Generation for Images via Key-phrase Estimation using Large-Scale Captioned Images [O] . 牛久祥孝 2014

机译：通过使用大型字幕图像的关键词短语估计自动生成图像的句子

Topic-Based Image Caption Generation

摘要

著录项

相似文献

相关主题

期刊订阅