...
首页> 外文期刊>Multimedia Tools and Applications >M-VAD names: a dataset for video captioning with naming
【24h】

M-VAD names: a dataset for video captioning with naming

机译:M-VAD名称:用于命名视频字幕的数据集

获取原文
获取原文并翻译 | 示例
           

摘要

Current movie captioning architectures are not capable of mentioning characters with their proper name, replacing them with a generic someone tag. The lack of movie description datasets with characters' visual annotations surely plays a relevant role in this shortage. Recently, we proposed to extend the M-VAD dataset by introducing such information. In this paper, we present an improved version of the dataset, namely M-VAD Names, and its semi-automatic annotation procedure. The resulting dataset contains 63 k visual tracks and 34 k textual mentions, all associated with character identities. To showcase the features of the dataset and quantify the complexity of the naming task, we investigate multimodal architectures to replace the someone tags with proper character names in existing video captions. The evaluation is further extended by testing this application on videos outside of the M-VAD Names dataset.
机译:当前的电影字幕体系结构无法用适当的名称提及角色,而用通用的someone标签代替它们。缺乏带有角色视觉注释的电影描述数据集肯定会在这种短缺中发挥重要作用。最近,我们建议通过引入此类信息来扩展M-VAD数据集。在本文中,我们提出了数据集的改进版本,即M-VAD名称及其半自动注释过程。结果数据集包含63 k条视觉轨迹和34 k条文字说明,所有这些都与角色标识相关联。为了展示数据集的特征并量化命名任务的复杂性,我们研究了多模式架构,以在现有视频字幕中用适当的字符名称替换someone标签。通过在M-VAD名称数据集之外的视频上测试此应用程序,可以进一步扩展评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号