Enhanced representation and multi-task learning for image annotation

Alexander Binder; Wojciech Samek; Klaus-Robert Mueller; Motoaki Kawanabe

首页> 外文期刊>Computer vision and image understanding >Enhanced representation and multi-task learning for image annotation

【24h】

Enhanced representation and multi-task learning for image annotation

机译：图像表示的增强表示和多任务学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we propose a novel biased random sampling strategy for image representation in Bag-of-Words models. We evaluate its impact on the feature properties and the ranking quality for a set of semantic concepts and show that it improves performance of classifiers in image annotation tasks and increases the correlation between kernels and labels. As second contribution we propose a method called Output Kernel Multi-Task Learning (MTL) to improve ranking performance by transfer information between classes. The main advantages of output kernel MTL are that it permits asymmetric information transfer between tasks and scales to training sets of several thousand images. We give a theoretical interpretation of the method and show that the learned contributions of source tasks to target tasks are semantically consistent. Both strategies are evaluated on the ImageCLEF PhotoAnnotation dataset. Our best visual result which used the MTL method was ranked first according to mean Average Precision (mAP) within the purely visual submissions in the ImageCLEF 2011 PhotoAnnotation Challenge. Our multi-modal submission achieved the first rank by mAP among all submissions in the same competition.

机译：在本文中，我们提出了一种新颖的有偏随机抽样策略，用于单词袋模型中的图像表示。我们评估了它对一组语义概念的特征属性和排名质量的影响，并表明它提高了图像标注任务中分类器的性能，并增加了内核和标签之间的相关性。作为第二贡献，我们提出了一种称为输出内核多任务学习（MTL）的方法，以通过在类之间传递信息来提高排名性能。输出内核MTL的主要优点在于，它允许任务之间的不对称信息传输，并缩放到数千个图像的训练集。我们对该方法进行了理论解释，并表明源任务对目标任务的学习贡献在语义上是一致的。两种策略都在ImageCLEF PhotoAnnotation数据集上进行评估。在ImageCLEF 2011 PhotoAnnotation挑战的纯视觉提交中，使用MTL方法获得的最佳视觉效果在平均视觉精度（mAP）中排名第一。我们的多模式提交方式在同一比赛的所有提交方式中均获得了mAP的第一名。

著录项

来源
《Computer vision and image understanding》 |2013年第5期|466-478|共13页
作者
Alexander Binder; Wojciech Samek; Klaus-Robert Mueller; Motoaki Kawanabe;
展开▼
作者单位

Fraunhofer Institute FIRST, Kekulestr. 7, 12489 Berlin, Germany Machine Learning Croup, Berlin Institute of Technology (TU Berlin), Marchstrasse 23, 10587 Berlin, Germany;

Fraunhofer Institute FIRST, Kekulestr. 7, 12489 Berlin, Germany Machine Learning Croup, Berlin Institute of Technology (TU Berlin), Marchstrasse 23, 10587 Berlin, Germany;

Machine Learning Croup, Berlin Institute of Technology (TU Berlin), Marchstrasse 23, 10587 Berlin, Germany Bernstein Focus: Neurotechnology Berlin, 10587 Berlin, Germany Department of Brain and Cognitive Engineering, Korea University, Anam-dong, Seongbuk-gu, Seoul 136-713, Republic of Korea;

ATR Brain Information Communication Research Laboratory Croup, 2-2-2 Hikaridai, Seika-cho, Soraku-gun,Kyoto 619-0288, Japan Fraunhofer Institute FIRST, Kekulestr. 7, 12489 Berlin, Germany;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Image ranking; Image classification; Multiple kernel learning; Multi task learning; Bag-of-Words representation; Biased random sampling; ImageCLEF; Mutual information;

机译：图片排名;图像分类;多核学习;多任务学习;词袋表示法;偏向随机抽样;ImageCLEF;相互信息;
入库时间 2022-08-17 13:21:07

相似文献

外文文献
中文文献
专利

1. Learning multi-task local metrics for image annotation [J] . Xu Xing, Shimada Atsushi, Nagahara Hajime, Multimedia Tools and Applications . 2016,第4期

机译：学习用于图像标注的多任务本地指标
2. LEARNING REGULARIZED MULTI-VIEW STRUCTURED SPARSE REPRESENTATION FOR IMAGE ANNOTATION [J] . ZHIQIANG XING, MIAO ZANG, YONGMEI ZHANG International Journal of Innovative Computing Information and Control . 2018,第4期

机译：用于图像标注的学习调节多视图结构化稀疏表示
3. Deep learning based feature representation for automated skin histopathological image annotation [J] . Zhang Gang, Hsu Ching-Hsien Robert, Lai Huadong, Multimedia Tools and Applications . 2018,第8期

机译：基于深度学习的特征表示可自动进行皮肤组织病理学图像注释
4. Simultaneous Image Annotation and Geo-Tag Prediction via Correlation Guided Multi-task Learning [C] . Wang Hua, Joshi Dhiraj, Luo Jiebo, 2012 IEEE International Symposium on Multimedia. . 2012

机译：通过相关引导的多任务学习同时进行图像注释和地理标记预测
5. Representations and Representation Learning for Image Aesthetics Prediction and Image Enhancement [D] . Kucer, Michal. 2020

机译：图像美学预测和图像增强的表示和代表学习
6. Two-Stage Multi-Task Representation Learning for Synthetic Aperture Radar (SAR) Target Images Classification [O] . Xinzheng Zhang, Yijian Wang, Zhiying Tan, 2017

机译：合成孔径雷达（SAR）目标图像分类的两阶段多任务表示学习
7. Enhanced representation and multi-task learning for image annotation [O] . Binder A., Samek W., Müller K.R., 2013

机译：增强的图像标注表示和多任务学习

Enhanced representation and multi-task learning for image annotation

摘要

著录项

相似文献

相关主题

期刊订阅