Learning multi-task local metrics for image annotation

Xu Xing; Shimada Atsushi; Nagahara Hajime; Taniguchi Rin-ichiro

首页> 外文期刊>Multimedia Tools and Applications >Learning multi-task local metrics for image annotation

【24h】

Learning multi-task local metrics for image annotation

机译：学习用于图像标注的多任务本地指标

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The goal of image annotation is to automatically assign a set of textual labels to an image to describe the visual contents thereof. Recently, with the rapid increase in the number of web images, nearest neighbor (NN) based methods have become more attractive and have shown exciting results for image annotation. One of the key challenges of these methods is to define an appropriate similarity measure between images for neighbor selection. Several distance metric learning (DML) algorithms derived from traditional image classification problems have been applied to annotation tasks. However, a fundamental limitation of applying DML to image annotation is that it learns a single global distance metric over the entire image collection and measures the distance between image pairs in the image-level. For multi-label annotation problems, it may be more reasonable to measure similarity of image pairs in the label-level. In this paper, we develop a novel label prediction scheme utilizing multiple label-specific local metrics for label-level similarity measure, and propose two different local metric learning methods in a multi-task learning (MTL) framework. Extensive experimental results on two challenging annotation datasets demonstrate that 1) utilizing multiple local distance metrics to learn label-level distances is superior to using a single global metric in label prediction, and 2) the proposed methods using the MTL framework to learn multiple local metrics simultaneously can model the commonalities of labels, thereby facilitating label prediction results to achieve state-of-the-art annotation performance.

机译：图像注释的目的是为图像自动分配一组文本标签，以描述其视觉内容。近来，随着网络图像数量的迅速增加，基于最近邻居（NN）的方法变得越来越有吸引力，并且显示出令人兴奋的图像标注结果。这些方法的主要挑战之一是在图像之间定义适当的相似度度量以进行邻居选择。源自传统图像分类问题的几种距离度量学习（DML）算法已应用于注释任务。但是，将DML应用于图像注释的基本局限性在于，它在整个图像集合中学习单个全局距离度量，并在图像级别测量图像对之间的距离。对于多标签注释问题，在标签级别测量图像对的相似度可能更合理。在本文中，我们开发了一种新颖的标签预测方案，该方案利用多个特定于标签的局部度量进行标签级相似性度量，并在多任务学习（MTL）框架中提出了两种不同的局部度量学习方法。在两个具有挑战性的注释数据集上的大量实验结果表明，1）在标签预测中利用多个局部距离度量学习标签级距离优于使用单个全局度量，以及2）使用MTL框架提出的方法来学习多个局部度量同时可以对标签的共性进行建模，从而有助于标签预测结果实现最新的标注性能。

著录项

来源
《Multimedia Tools and Applications》 |2016年第4期|2203-2231|共29页
作者
Xu Xing; Shimada Atsushi; Nagahara Hajime; Taniguchi Rin-ichiro;
展开▼
作者单位

Kyushu Univ, Dept Adv Informat & Technol, Fukuoka 812, Japan;

Kyushu Univ, Dept Adv Informat & Technol, Fukuoka 812, Japan;

Kyushu Univ, Dept Adv Informat & Technol, Fukuoka 812, Japan;

Kyushu Univ, Dept Adv Informat & Technol, Fukuoka 812, Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Image annotation; Label prediction; Metric learning; Local metric; Multi-task learning;

机译：图像标注;标签预测;度量学习;局部度量;多任务学习;
入库时间 2022-08-17 13:04:19

相似文献

外文文献
中文文献
专利

1. Indoor scene recognition via multi-task metric multi-kernel learning from RGB-D images [J] . Zheng Yu, Gao Xinbo Multimedia Tools and Applications . 2017,第3期

机译：通过从RGB-D图像进行多任务度量多内核学习的室内场景识别
2. Enhanced representation and multi-task learning for image annotation [J] . Alexander Binder, Wojciech Samek, Klaus-Robert Mueller, Computer vision and image understanding . 2013,第5期

机译：图像表示的增强表示和多任务学习
3. Image distance metric learning based on neighborhood sets for automatic image annotation [J] . Jin Cong, Jin Shu-Wei Journal of visual communication & image representation . 2016,第Jana期

机译：基于邻域集的图像距离度量学习用于自动图像标注
4. Simultaneous Image Annotation and Geo-Tag Prediction via Correlation Guided Multi-task Learning [C] . Wang Hua, Joshi Dhiraj, Luo Jiebo, 2012 IEEE International Symposium on Multimedia. . 2012

机译：通过相关引导的多任务学习同时进行图像注释和地理标记预测
5. Image annotation and tag completion via kernel metric learning and noisy matrix recovery. [D] . Feng, Zheyun. 2016

机译：通过内核度量学习和噪声矩阵恢复实现图像注释和标签完成。
6. Multi-task learning with a natural metric for quantitative structure activity relationship learning [O] . Noureddin Sadawi, Ivan Olier, Joaquin Vanschoren, 2019

机译：具有自然指标的多任务学习用于定量结构活动关系学习
7. Enhanced representation and multi-task learning for image annotation [O] . Binder A., Samek W., Müller K.R., 2013

机译：增强的图像标注表示和多任务学习

Learning multi-task local metrics for image annotation

摘要

著录项

相似文献

相关主题

期刊订阅