首页> 外文会议>Joint conference on lexical and computational semantics >See No Evil, Say No Evil: Description Generation from Densely Labeled Images
【24h】

See No Evil, Say No Evil: Description Generation from Densely Labeled Images

机译:看到没有邪恶,说没有邪恶:密集标记图像的描述生成

获取原文

摘要

This paper studies generation of descriptive sentences from densely annotated images. Previous work studied generation from automatically detected visual information but produced a limited class of sentences, hindered by currently unreliable recognition of activities and attributes. Instead, we collect human annotations of objects, parts, attributes and activities in images. These annotations allow us to build a significantly more comprehensive model of language generation and allow us to study what visual information is required to generate human-like descriptions. Experiments demonstrate high quality output and that activity annotations and relative spatial location of objects contribute most to producing high quality sentences.
机译:本文研究了从密集注释的图像生成描述性句子的过程。先前的工作研究了自动检测到的视觉信息的生成,但是由于当前对活动和属性的不可靠识别而产生的句子种类有限。相反,我们收集图像中对象,零件,属性和活动的人工注释。这些注释使我们可以构建语言生成的更为全面的模型,并使我们能够研究生成类似于人的描述所需要的视觉信息。实验证明了高质量的输出,并且活动注释和对象的相对空间位置对产生高质量句子的贡献最大。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号