Automatic image annotation based on Gaussian mixture model considering cross-modal correlations

Tian Dongping; Shi Zhongzhi

首页> 外文期刊>Journal of visual communication & image representation >Automatic image annotation based on Gaussian mixture model considering cross-modal correlations

【24h】

Automatic image annotation based on Gaussian mixture model considering cross-modal correlations

机译：考虑跨模态相关的基于高斯混合模型的自动图像标注

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Automatic image annotation has been an active topic of research in the field of computer vision and pattern recognition for decades. In this paper, we present a new method for automatic image annotation based on Gaussian mixture model (GMM) considering cross-modal correlations. To be specific, we first employ GMM fitted by the rival penalized expectation-maximization (RPEM) algorithm to estimate the posterior probabilities of each annotation keyword. Next, a label similarity graph is constructed by a weighted linear combination of label similarity and visual similarity by seamlessly integrating the information from both image low level visual features and high level semantic concepts together, which can effectively avoid the phenomenon that different images with the same candidate annotations would obtain the same refinement results. Followed by the rank-two relaxation heuristics over the built label similarity graph is applied to further mine the correlation of the candidate annotations so as to capture the refining annotation results, which plays a crucial role in the semantic based image retrieval. The main contributions of this work can be summarized as follows: (1) Exploiting GMM that is trained by the RPEM algorithm to capture the initial semantic annotations of images. (2) The label similarity graph is constructed by a weighted linear combination of label similarity and visual similarity of images associated with the corresponding labels. (3) Refining the candidate set of annotations generated by the GMM through solving the max-bisection based on the rank-two relaxation algorithm over the weighted label graph. Compared to the current competitive model SGMM-RW, we can achieve significant improvements of 4% and 5% in precision, 6% and 9% in recall on the Corel5k and Mirflickr25k, respectively. (C) 2017 Elsevier Inc. All rights reserved.

机译：几十年来，自动图像标注一直是计算机视觉和模式识别领域研究的一个活跃主题。在本文中，我们提出了一种基于高斯混合模型（GMM）并考虑跨模态相关性的自动图像标注新方法。具体来说，我们首先采用竞争对手的惩罚性期望最大化（RPEM）算法拟合的GMM来估算每个注释关键字的后验概率。接下来，通过标签相似度和视觉相似度的加权线性组合，将来自图像低层视觉特征和高级语义概念的信息无缝地整合在一起，构建标签相似度图，可以有效地避免相同图像不同的现象。候选注释将获得相同的优化结果。其次，通过对构建的标签相似度图进行二级松弛启发式，进一步挖掘候选注释的相关性，以捕获精炼注释结果，这在基于语义的图像检索中起着至关重要的作用。这项工作的主要贡献可以归纳如下：（1）利用由RPEM算法训练的GMM来捕获图像的初始语义注释。（2）标签相似度图由标签相似度和与相应标签相关联的图像的视觉相似度的加权线性组合构成。（3）通过基于加权标签图上的秩2松弛算法求解最大二等分，优化GMM生成的注释的候选集。与目前的竞争机型SGMM-RW相比，我们在Corel5k和Mirflickr25k上的精度分别达到4％和5％的显着提高，召回率分别为6％和9％的显着提高。（C）2017 Elsevier Inc.保留所有权利。

著录项

来源
《Journal of visual communication & image representation》 |2017年第4期|50-60|共11页
作者
Tian Dongping; Shi Zhongzhi;
展开▼
作者单位

Baoji Univ Arts & Sci, Inst Comp Software, No 44,Baoguang Rd, Baoji 721007, Shaanxi, Peoples R China;

Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, PR, Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Automatic image annotation; Gaussian mixture model; RPEM; Semantic correlation; Max-bisection; Image retrieval;

机译：自动图像标注;高斯混合模型;RPEM;语义相关;最大二等分;图像检索;

相似文献

外文文献
中文文献
专利

1. Automatic Medical Image Segmentation Based on Finite Skew Gaussian Mixture Model [J] . Vadaparthi Nagesh, Yerramalli Srinivas, Penumatsa Suresh, The international arab journal of information technology . 2016,第5期

机译：基于有限偏高斯混合模型的医学图像自动分割
2. Automatic target recognition of synthetic aperture radar images via gaussian mixture modeling of target outlines [J] . Zhu Xueling, Huang Zhangmin, Zhang Zhenyu Optik: Zeitschrift fur Licht- und Elektronenoptik: = Journal for Light-and Electronoptic . 2019,第期

机译：通过高斯混合建模的目标轮廓的自动目标识别合成孔径雷达图像
3. Query Mining for Automatic Annotation and Annotation Based Image Retrieval Using Hidden Markov Model [J] . Shahidha M Meeran, Bineesh V International Journal of Innovative Research in Science, Engineering and Technology . 2014,第5期

机译：使用隐马尔可夫模型的查询挖掘自动注释和基于注释的图像检索
4. Automatic video annotation via Hierarchical Topic Trajectory Model considering cross-modal correlations [C] . Nakano Takuho, Kimura Akisato, Kameoka Hirokazu, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：通过考虑跨模态相关性的分层主题轨迹模型进行自动视频注释
5. Mixtures of inverse covariances: Covariance modeling for Gaussian mixtures with applications to automatic speech recognition. [D] . Vanhoucke, Vincent. 2004

机译：逆协方差的混合：高斯混合的协方差建模及其在自动语音识别中的应用。
6. Pleiotropic mapping and annotation selection in genome-wide association studies with penalized Gaussian mixture models [O] . Ping Zeng, Xingjie Hao, Xiang Zhou -1

机译：惩罚性高斯混合模型在全基因组关联研究中的多效性作图和注释选择
7. AUTOMATIC VIDEO ANNOTATION VIA HIERARCHICAL TOPIC TRAJECTORY MODEL CONSIDERING CROSS-MODAL CORRELATIONS [O] . Takuho Nakano, Akisato Kimura, Hirokazu Kameoka, 2012

机译：通过考虑跨模态相关性的分层主题弹道模型进行自动视频标注

Automatic image annotation based on Gaussian mixture model considering cross-modal correlations

摘要

著录项

相似文献

相关主题

期刊订阅