Image automatic annotation via multi-view deep representation

Yang Yang; Zhang Wensheng; Xie Yuan

首页> 外文期刊>Journal of visual communication & image representation >Image automatic annotation via multi-view deep representation

【24h】

Image automatic annotation via multi-view deep representation

机译：通过多视图深度表示的图像自动注释

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The performance of text-based image retrieval is highly dependent on the tedious and inefficient manual work. For the purpose of realizing image keywords generated automatically, extensive work has been done in the area of image annotation. However, how to treat image diverse keywords and choose appropriate features are still two difficult problems. To address this challenge, we propose the multi-view stacked auto-encoder (MVSAE) framework to establish the correlations between the low-level visual features and high-level semantic information. In this paper, a new method, which incorporates the keyword frequencies and log-entropy, is presented to address the imbalanced distribution of keywords. In order to utilize the complementarities among diverse visual descriptors, we tactfully apply multi-view learning to search for the label-specific features. Thereafter, the image keywords are finally produced by appropriate features. Conducting extensive experiments on three popular data sets, we demonstrate that our proposed framework can achieve effective and favorable performance for image annotation. (C) 2015 Elsevier Inc. All rights reserved.

机译：基于文本的图像检索的性能高度依赖于繁琐且效率低下的手动工作。为了实现自动生成的图像关键词，在图像注释领域已经进行了广泛的工作。然而，如何对待图像多样化的关键词并选择合适的特征仍然是两个难题。为了解决这一挑战，我们提出了多视图堆叠自动编码器（MVSAE）框架，以建立低层视觉特征与高层语义信息之间的相关性。本文提出了一种结合关键字频率和对数熵的新方法来解决关键字分布不均的问题。为了利用各种视觉描述符之间的互补性，我们巧妙地应用多视图学习来搜索标签特定的功能。此后，图像关键字最终由适当的特征产生。在三个流行的数据集上进行了广泛的实验，我们证明了我们提出的框架可以实现有效和有利的图像标注性能。（C）2015 Elsevier Inc.保留所有权利。

著录项

来源
《Journal of visual communication & image representation》 |2015年第11期|368-377|共10页
作者
Yang Yang; Zhang Wensheng; Xie Yuan;
展开▼
作者单位

Univ Chinese Acad Sci, Inst Automat, Shanghai, Peoples R China;

Univ Chinese Acad Sci, Inst Automat, Shanghai, Peoples R China;

Univ Chinese Acad Sci, Inst Automat, Shanghai, Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Image annotation; Stacked auto-encoder; Imbalance learning; Multi-view learning; Image features; Semantic gap; Deep learning; Multi-labeling;

机译：图像标注;堆叠式自动编码器;不平衡学习;多视图学习;图像特征;语义间隙;深度学习;多标签;

相似文献

外文文献
中文文献
专利

1. LEARNING REGULARIZED MULTI-VIEW STRUCTURED SPARSE REPRESENTATION FOR IMAGE ANNOTATION [J] . ZHIQIANG XING, MIAO ZANG, YONGMEI ZHANG International Journal of Innovative Computing Information and Control . 2018,第4期

机译：用于图像标注的学习调节多视图结构化稀疏表示
2. Joint multi-view representation and image annotation via optimal predictive subspace learning [J] . Zhe Xue, Guorong Li, Qingming Huang Information Sciences: An International Journal . 2018,第期

机译：通过最佳预测子空间学习联合多视图表示和图像注释
3. Automatic target recognition with joint sparse representation of heterogeneous multi-view SAR images over a locally adaptive dictionary [J] . Zongjie Cao, Liyuan Xu, Jilan Feng Signal processing . 2016,第sepa期

机译：联合自适应稀疏表示的局部自适应字典上异质多视图SAR图像的自动目标识别
4. Hybrid image representation methods for automatic image annotation: A survey [C] . Bouyerbou Hafidha, Oukid Saliha, Benblidia Nadjia, 2012 International Conference on Signals and Electronic Systems. . 2012

机译：用于自动图像注释的混合图像表示方法：一项调查
5. Reconstructing and Optimizing Natural Images Perceived by the Human Brain Based on Bayesian Deep Multi-View Learning [D] . Li, Xintong. 2021

机译：基于贝叶斯深度多视图学习的人脑重建和优化自然图像
6. Recognition of EEG Signal Motor Imagery Intention Based on Deep Multi-View Feature Learning [O] . Jiacan Xu, Hao Zheng, Jianhui Wang, 2020

机译：基于深度多视角特征学习的脑电信号运动意象识别
7. Automatic Image Annotation using Deep Learning Representations [O] . Venkatesh N. Murthy, Subhransu Maji, R. Manmatha 2015

机译：使用深度学习表示的自动图像注释

Image automatic annotation via multi-view deep representation

摘要

著录项

相似文献

相关主题

期刊订阅