Hierarchical BoW with segmental sparse coding for large scale image classification and retrieval

Jianshe Zhou; Narentuya; Sheng Tang; Jie Liu

首页> 外文期刊>Multimedia Tools and Applications >Hierarchical BoW with segmental sparse coding for large scale image classification and retrieval

【24h】

Hierarchical BoW with segmental sparse coding for large scale image classification and retrieval

机译：带分段稀疏编码的分层BoW用于大规模图像分类和检索

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The bag-of-words (BoW) has been widely regarded as the most successful algorithms for content-based image related tasks, such as large scale image retrieval, classification, and object categorization. Large visual words acquired by BoW quantization through large vocabulary or codebooks have been receiving much attention in the past years. However, not only construction of large vocabulary but also the quantization process impose a heavy burden in terms of time and memory complexities. In order to tackle this issue, we propose an efficient hierarchical BoW (HBoW) to achieve large visual words through quantization by a compact vocabulary instead of large vocabulary. Our vocabulary is very compact since it is only composed of two small dictionaries which is learned through segmental sparse decomposition of local features. To generate the BoW with large size, we first divide the local features into two half parts, and use the two small dictionaries to compute their sparse codes. Then, we map the two indices of the maximum elements of the two sparse codes to a large set of visual words based upon the fact that data with similar properties will share the same base weighted with the largest sparse coefficient. To further make similar patches have higher probability of select the same dictionary base to get similar BoW vectors, we propose a novel collaborative dictionary learning method by imposing the similarity regularization factor together with the row sparsity regularization across data instances during group sparse coding. Additionally, based on index combination of top-2 large sparse codes of local descriptors, we propose a soft BoW assignment method so that our proposed HBoW can tolerate different word selection for similar patches. By employing the inverted file structure built through our HBoW, K-nearest neighbors (KNN) can be efficiently retrieved. After incorporation of our fast KNN search into the SVM-KNN classification method, our HBoW can be used for efficient image classification and logo recognition. Experiments on serval well-known datasets show that our approach is effective for large scale image classification and retrieval.

机译：词袋（BoW）已被广泛认为是基于内容的图像相关任务（例如大规模图像检索，分类和对象分类）的最成功算法。在过去的几年中，通过BoW量化通过大型词汇或密码本获得的大型视觉单词受到了广泛关注。但是，不仅大量词汇的构建而且量化过程在时间和存储器复杂性方面都带来了沉重的负担。为了解决此问题，我们提出了一种有效的层次化BoW（HBoW），可以通过压缩而不是大词汇量的量化词来实现大视觉单词。我们的词汇表非常紧凑，因为它仅由两个小字典组成，这些字典是通过局部特征的分段稀疏分解来学习的。为了生成较大的BoW，我们首先将局部特征分成两个半部分，然后使用这两个小字典来计算其稀疏代码。然后，基于具有相似属性的数据将共享具有最大稀疏系数的相同基本加权这一事实，我们将两个稀疏代码的最大元素的两个索引映射到一大组视觉词。为了进一步使相似的补丁有更高的概率选择相同的词典库来获得相似的BoW向量，我们通过在组稀疏编码期间对数据实例跨所有实例强加相似度正则化因子和行稀疏度正则化，提出了一种新颖的协作词典学习方法。此外，基于局部描述符的前2个大型稀疏码的索引组合，我们提出了一种软BoW分配方法，以便我们提出的HBoW可以容忍相似补丁的不同单词选择。通过使用通过我们的HBoW构建的反向文件结构，可以有效地检索K最近邻居（KNN）。将我们的快速KNN搜索纳入SVM-KNN分类方法后，我们的HBoW可用于有效的图像分类和徽标识别。对服务著名的数据集进行的实验表明，我们的方法对于大规模图像分类和检索是有效的。

著录项

来源
《Multimedia Tools and Applications》 |2018年第17期|22319-22338|共20页
作者
Jianshe Zhou; Narentuya; Sheng Tang; Jie Liu;
展开▼
作者单位

Beijing Advanced Innovation Center for Imaging Technology, Capital Normal University;

Beijing Advanced Innovation Center for Imaging Technology, Capital Normal University;

Institute of Computing Technology, Chinese Academy of Sciences;

College of Information and Engineering, Capital Normal University;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Bag of words; Dictionary learning; Sparse coding; Image retrieval; Image classification;

机译：词袋;字典学习;稀疏编码;图像检索;图像分类;

相似文献

外文文献
中文文献
专利

1. Ontology-driven hierarchical sparse coding for large-scale image classification [J] . Zhang Yan, Qu Yanyun, Li Cuihua, Neurocomputing . 2019,第Sepa30期

机译：本体驱动的分层稀疏编码用于大规模图像分类
2. Ontology-driven hierarchical sparse coding for large-scale image classification [J] . Zhang Yan, Qu Yanyun, Li Cuihua, Neurocomputing . 2019,第SEPa30期

机译：本体驱动的分层稀疏编码用于大规模图像分类
3. Scene classification of remote sensing images based on hierarchical sparse coding [J] . Xu Jiaqing, Lv Qi, Liu Hongjun, The Journal of Engineering . 2018,第16期

机译：基于分层稀疏编码的遥感影像场景分类
4. The Unsupervised Hierarchical Convolutional Sparse Auto-Encoder for Neuroimaging Data Classification [C] . Xiaobing Han, Yanfei Zhong, Lifang He, International conference on brain informatics and health . 2015

机译：用于神经影像数据分类的无监督分层卷积稀疏自动编码器
5. Large-scale image retrieval using similarity preserving binary codes [D] . Gong, Yunchao 2014

机译：使用保留相似性的二进制代码进行大规模图像检索
6. Phaseless Terahertz Coded-Aperture Imaging for Sparse Target Based on Phase Retrieval Algorithm [O] . Long Peng, Chenggao Luo, Bin Deng, 2019

机译：基于相位检索算法的稀疏目标的无相太赫兹编码孔径成像
7. Phaseless Terahertz Coded-Aperture Imaging for Sparse Target Based on Phase Retrieval Algorithm [O] . 2019

机译：基于相位检索算法的稀疏目标释放太赫兹编码 - 光圈成像

Hierarchical BoW with segmental sparse coding for large scale image classification and retrieval

摘要

著录项

相似文献

相关主题

期刊订阅