UIT-ViIC: A Dataset for the First Evaluation on Vietnamese Image Captioning

机译：UIT-VIIC：用于越南图像标题的第一次评估的数据集

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Image Captioning (IC), the task of automatic generation of image captions, has attracted attentions from researchers in many fields of computer science, being computer vision, natural language processing and machine learning in recent years. This paper contributes to research on Image Captioning task in terms of extending dataset to a different language - Vietnamese. So far, there has been no existed Image Captioning dataset for Vietnamese language, so this is the foremost fundamental step for developing Vietnamese Image Captioning. In this scope, we first built a dataset which contains manually written captions for images from Microsoft COCO dataset relating to sports played with balls, we called this dataset UIT-VilC (University Of Information Technology -Vietnamese Image Captions). UIT-VilC consists of 19,250 Vietnamese captions for 3,850 images. Following that, we evaluated our dataset on deep neural network models and did comparisons with English dataset and two Vietnamese datasets built by different methods. UIT-VilC is published on our lab website for research purposes.

机译：图像标题（IC）是自动生成图像标题的任务，吸引了计算机科学许多领域的研究人员的关注，是近年来的计算机视觉，自然语言处理和机器学习。本文有助于将数据集扩展到不同语言 - 越南语的图像标题任务的研究。到目前为止，越南语没有存在图像标题数据集，这是开发越南图像标题的最重要的基本步骤。在这个范围内，我们首先建立了一个数据集，其中包含与球员扮演的运动员有关的Microsoft Coco DataSet的手动写入标题，我们叫这个DataSet UIT-Vilc（信息技术大学 - 申请图片标题）。 UIT-VILC由19,250个越南标题组成3,850个图像。在此之后，我们在深神经网络模型上评估了我们的数据集，并与英语数据集和由不同方法构建的两个越南数据集进行了比较。 UIT-VILC发布在我们的实验室网站上进行研究目的。

著录项

来源
《International Conference on Computational Collective Intelligence》|2020年|730-742|共13页
会议地点
作者
Quan Hoang Lam; Quang Duy Le; Van Kiet Nguyen; Ngan Luu-Thuy Nguyen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Image Captioning; Vietnamese; Deep neural network;

机译：图像标题;越南语;深神经网络;

相似文献

外文文献
中文文献
专利

1. Self-Guiding Multimodal LSTM—When We Do Not Have a Perfect Training Dataset for Image Captioning [J] . Yang Xian, Yingli Tian IEEE Transactions on Image Processing . 2019,第11期

机译：自指导多峰LSTM-当我们没有完美的图像字幕训练数据集时
2. Optimization of window settings for virtual monoenergetic imaging in dual-energy CT of the liver: A multi-reader evaluation of standard monoenergetic and advanced imaged-based monoenergetic datasets [J] . De Cecco Carlo N., Caruso Damiano, Schoepf U. Joseph, European Journal of Radiology . 2016,第4期

机译：肝双能CT中虚拟单能成像的窗口设置的优化：标准单能和基于图像的先进单能数据集的多读取器评估
3. Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures [J] . Bernardi Raffaella, Cakici Ruket, Elliott Desmond, The Journal of Artificial Intelligence Research . 2016,第10期

机译：从图像自动生成描述：模型，数据集和评估措施的调查
4. Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning [C] . Piyush Sharma, Nan Ding, Sebastian Goodman, Annual meeting of the Association for Computational Linguistics . 2018

机译：概念性字幕：用于自动图像字幕的，干净的，上位的图像替代文本数据集
5. Image Captioning: A Survey of Existing Issues on Datasets, Evaluation Metrics and Methods [D] . zhou, liwan . 2020

机译：图像字幕：对数据集的现有问题，评估度量和方法的调查
6. Image datasets of cocoa beans for taxonomy nuances evaluation [O] . F.A. Santos, E.S. Palmeira, G.Q. Jesus 2019

机译：可可豆图像数据集用于分类细微差别评估
7. STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset [O] . Yoshikawa, Yuya, Shigeto, Yutaro, Takeuchi, Akikazu 2017

机译：sTaIR字幕：构建大型日文图像标题数据集

UIT-ViIC: A Dataset for the First Evaluation on Vietnamese Image Captioning

摘要

著录项

相似文献

相关主题

期刊订阅