Embedding Multi-Order Spatial Clues for Scalable Visual Matching and Retrieval

首页> 外文期刊>Emerging and Selected Topics in Circuits and Systems, IEEE Journal on >Embedding Multi-Order Spatial Clues for Scalable Visual Matching and Retrieval

【24h】

Embedding Multi-Order Spatial Clues for Scalable Visual Matching and Retrieval

机译：嵌入多阶空间线索以进行可扩展的视觉匹配和检索

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Matching duplicate visual contents among images serves as the basis of many vision tasks. Researchers have proposed different local descriptors for image matching, e.g., floating point descriptors like SIFT, SURF, and binary descriptors like ORB and BRIEF. These descriptors either suffer from relatively expensive computation or limited robustness due to the compact binary representation. This paper studies how to improve the matching efficiency and accuracy of floating points descriptors and the matching accuracy of binary descriptors. To achieve this goal, we embed the spatial clues among local descriptors to a novel local feature, i.e., multi-order visual phrase which contains two complementary clues: 1) the center visual clues extracted at each image keypoint and 2) the neighbor visual and spatial clues of multiple nearby keypoints. Different from existing visual phrase features, two multi-order visual phrases are flexibly matched by first matching their center visual clues, then estimating a match confidence by checking the spatial and visual consistency of their neighbor keypoints. Therefore, multi-order visual phrase does not scarify the repeatability of classic visual word and is more robust to the quantization error than existing visual phrase features. We extract multi-order visual phrases from both SIFT and ORB and test them in image matching and retrieval tasks on UKbench, Oxford5K, and 1 million distractor images collected from Flickr. Comparisons with recent retrieval approaches clearly demonstrate the competitive accuracy and significantly better efficiency of our approaches.

机译：在图像之间匹配重复的视觉内容是许多视觉任务的基础。研究人员提出了用于图像匹配的不同局部描述符，例如SIFT，SURF等浮点描述符以及ORB和Brief的二进制描述符。这些描述符由于紧凑的二进制表示而遭受了相对昂贵的计算或有限的鲁棒性。本文研究如何提高浮点描述符的匹配效率和精度以及二进制描述符的匹配精度。为了实现这一目标，我们将局部描述符之间的空间线索嵌入到一个新颖的局部特征中，即包含两个互补线索的多级视觉短语：1）在每个图像关键点提取的中心视觉线索； 2）邻居视觉线索和附近多个关键点的空间线索。与现有的视觉短语功能不同，两个多阶视觉短语通过首先匹配其中心视觉线索，然后通过检查其相邻关键点的空间和视觉一致性来估计匹配置信度，从而灵活地进行匹配。因此，多阶视觉短语不会破坏经典视觉单词的可重复性，并且比现有视觉短语功能更能抵抗量化误差。我们从SIFT和ORB中提取多阶视觉短语，并在UKbench，Oxford5K和从Flickr收集的100万个干扰物图像的图像匹配和检索任务中对其进行测试。与最新检索方法的比较清楚地表明了我们方法的竞争准确性和明显更高的效率。

著录项

来源
《Emerging and Selected Topics in Circuits and Systems, IEEE Journal on》 |2014年第1期|130-141|共12页
作者

展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Image local descriptor; image matching; large-scale image retrieval; visual vocabulary;

机译：图像局部描述符;图像匹配;大规模图像检索;视觉词汇;

相似文献

外文文献
中文文献
专利

1. Partial-duplicate image retrieval using spatial and visual contextual clues [J] . International Journal of Embedded Systems . 2020,第2期

机译：使用空间和视觉上下文线索的部分复制图像检索
2. Fast gradual matching measure for image retrieval based on visual similarity and spatial relations [J] . Omhover JF, Detyniecki M International Journal of Intelligent Systems . 2006,第7期

机译：基于视觉相似度和空间关系的图像快速渐进匹配度量
3. Scalable Video Event Retrieval by Visual State Binary Embedding [J] . Litao Yu, Zi Huang, Jiewei Cao, IEEE transactions on multimedia . 2016,第8期

机译：通过可视状态二进制嵌入进行可伸缩视频事件检索
4. Long-Term Loop Closure Detection through Visual-Spatial Information Preserving Multi-Order Graph Matching [C] . Peng Gao, Hao Zhang AAAI Conference on Artificial Intelligence . 2020

机译：通过保持多阶图匹配的可视空间信息的长期环路闭合检测
5. Visual-spatial processing and mathematics achievement: The predictive ability of the visual-spatial measures of the Stanford-Binet intelligence scales, Fifth Edition and the Wechsler Intelligence Scale for Children -Fourth Edition. [D] . Clifford, Eldon. 2008

机译：视觉空间处理和数学成就：Stanford-Binet智力量表（第五版）和Wechsler儿童智力量表（第四版）的视觉空间量度的预测能力。
6. Biomedical image representation approach using visualness and spatial information in a concept feature space for interactive region-of-interest-based retrieval [O] . Md. Mahmudur Rahman, Sameer K. Antani, Dina Demner-Fushman, 2015

机译：在概念特征空间中使用视觉和空间信息的生物医学图像表示方法用于基于兴趣区域的交互式检索
7. Long-Term Loop Closure Detection through Visual-Spatial Information Preserving Multi-Order Graph Matching [O] . Peng Gao, Hao Zhang 2020

机译：长期回路闭合检测通过保持多阶图匹配的视觉空间信息
8. A Spatial Frequency Analysis Model for Predicting Human Performance at Visual Pattern Matching Tasks. [R] . Cannon, M. W. 1977

机译：视觉匹配任务中人员绩效预测的空间频率分析模型。

Embedding Multi-Order Spatial Clues for Scalable Visual Matching and Retrieval

摘要

著录项

相似文献

相关主题

期刊订阅