M-VAD names: a dataset for video captioning with naming

Pini Stefano; Cornia Marcella; Bolelli Federico; Baraldi Lorenzo; Cucchiara Rita

首页> 外文期刊>Multimedia Tools and Applications >M-VAD names: a dataset for video captioning with naming

【24h】

M-VAD names: a dataset for video captioning with naming

机译：M-VAD名称：用于命名视频字幕的数据集

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Current movie captioning architectures are not capable of mentioning characters with their proper name, replacing them with a generic someone tag. The lack of movie description datasets with characters' visual annotations surely plays a relevant role in this shortage. Recently, we proposed to extend the M-VAD dataset by introducing such information. In this paper, we present an improved version of the dataset, namely M-VAD Names, and its semi-automatic annotation procedure. The resulting dataset contains 63 k visual tracks and 34 k textual mentions, all associated with character identities. To showcase the features of the dataset and quantify the complexity of the naming task, we investigate multimodal architectures to replace the someone tags with proper character names in existing video captions. The evaluation is further extended by testing this application on videos outside of the M-VAD Names dataset.

机译：当前的电影字幕体系结构无法用适当的名称提及角色，而用通用的someone标签代替它们。缺乏带有角色视觉注释的电影描述数据集肯定会在这种短缺中发挥重要作用。最近，我们建议通过引入此类信息来扩展M-VAD数据集。在本文中，我们提出了数据集的改进版本，即M-VAD名称及其半自动注释过程。结果数据集包含63 k条视觉轨迹和34 k条文字说明，所有这些都与角色标识相关联。为了展示数据集的特征并量化命名任务的复杂性，我们研究了多模式架构，以在现有视频字幕中用适当的字符名称替换someone标签。通过在M-VAD名称数据集之外的视频上测试此应用程序，可以进一步扩展评估。

著录项

来源
《Multimedia Tools and Applications》 |2019年第10期|14007-14027|共21页
作者
Pini Stefano; Cornia Marcella; Bolelli Federico; Baraldi Lorenzo; Cucchiara Rita;
展开▼
作者单位

Univ Modena & Reggio Emilia, Dept Engn Enzo Ferrari, Modena, Italy;

Univ Modena & Reggio Emilia, Dept Engn Enzo Ferrari, Modena, Italy;

Univ Modena & Reggio Emilia, Dept Engn Enzo Ferrari, Modena, Italy;

Univ Modena & Reggio Emilia, Dept Engn Enzo Ferrari, Modena, Italy;

Univ Modena & Reggio Emilia, Dept Engn Enzo Ferrari, Modena, Italy;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Video captioning; Naming; Dataset; Deep learning;

机译：视频字幕;命名;数据集;深度学习;

相似文献

外文文献
中文文献
专利

1. M-VAD names: a dataset for video captioning with naming [J] . Pini Stefano, Cornia Marcella, Bolelli Federico, Multimedia Tools and Applications . 2019,第10期

机译：M-VAD名称：具有命名的视频字幕的数据集
2. Biomedical named entity recognition and linking datasets: survey and our recent development [J] . Ming-Siang Huang, Po-Ting Lai, Pei-Yen Lin, Briefings in bioinformatics . 2020,第6期

机译：生物医学命名实体识别和链接数据集：调查和我们最近的发展
3. Interlinking SciGraph and DBpedia Datasets Using Link Discovery and Named Entity Recognition Techniques [J] . Beyza Yaman, Michele Pasin, Markus Freudenberg OASIcs : OpenAccess Series in Informatics . 2019,第1期

机译：使用链接发现和命名实体识别技术互连SciGraph和DBpedia数据集
4. Towards Video Captioning with Naming: A Novel Dataset and a Multi-modal Approach [C] . Stefano Pini, Marcella Cornia, Lorenzo Baraldi, International conference on image analysis and processing . 2017

机译：使用命名的视频字幕：一种新颖的数据集和一种多模式方法
5. Distributed Dataset Synchronization in Named Data Networking [D] . Shang, Wentao. 2017

机译：命名数据网络中的分布式数据集同步
6. Liberating links between datasets using lightweight data publishing: an example using plant names and the taxonomic literature [O] . Roderic Page 2018

机译：使用轻量级数据发布来解放数据集之间的链接：使用植物名称和分类文献的示例
7. M-VAD names: a dataset for video captioning with naming [O] . Stefano Pini, Marcella Cornia, Federico Bolelli, 2018

机译：M-VAD名称：具有命名的视频字幕的数据集

M-VAD names: a dataset for video captioning with naming

摘要

著录项

相似文献

相关主题

期刊订阅