Kernelized pyramid nearest-neighbor search for object categorization

Hong Cheng; Rongchao Yu; Zicheng Liu; Lu Yang; Xue-wen Chen

首页> 外文期刊>Machine Vision and Applications >Kernelized pyramid nearest-neighbor search for object categorization

【24h】

Kernelized pyramid nearest-neighbor search for object categorization

机译：核化金字塔最近邻搜索用于对象分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nearest-neighbor-based image classification has drawn considerable attention in the past several years thanks to its simplicity and efficiency. Recently, a Kernelized version of Naive-Bayes Nearest-Neighbor (KNBNN) approach has been proposed to combine Nearest-Neighbor-based approaches with other bag-of-feature (BoF) based kernels. However, similar to an orderless BoF image representation, the KNBNN ignores global geometric correspondence. In this paper, our contributions are threefolded. First, we present a technique to exploit the global geometric correspondence in a kernelized NBNN classifier framework. We divide an image into increasingly fine sub-regions like the spatial pyramid matching (SPM) approach; Second, we introduce a pyramid nearest-neighbor kernel by measuring the local similarity in each pyramid window. Third, for better calibrating the outputs of each window, we fit a sigmoid function to add posterior probability to its SVM outputs, and then weight these outputs of all windows. The sigmoid parameters and weight values are learned in a class-dependent and window-dependent manner. By doing so, we learn a class-specific geometric correspondence. Finally, the proposed approach is evaluated on two public datasets: Scene-15 and Caltech-101. We reach 85.2 % recognition rate on Scene-15 and 73.3 % on Caltech-101 only using single descriptor. The experimental results show that our approach significantly outperforms existing techniques.

机译：基于近邻的图像分类在过去几年中由于其简单性和效率而备受关注。最近，已经提出了Naive-Bayes最近邻（KNBNN）方法的内核版本，以将基于最近邻的方法与其他基于功能包（BoF）的内核相结合。但是，类似于无序BoF图像表示，KNBNN忽略了全局几何对应。在本文中，我们的贡献是三重的。首先，我们提出一种在带内核的NBNN分类器框架中利用全局几何对应关系的技术。我们将图像划分为越来越精细的子区域，例如空间金字塔匹配（SPM）方法；其次，我们通过测量每个金字塔窗口中的局部相似性来引入金字塔最近邻内核。第三，为了更好地校准每个窗口的输出，我们拟合了一个S型函数以向其SVM输出添加后验概率，然后对所有窗口的这些输出进行加权。以类相关和窗口相关的方式学习S形参数和权重值。通过这样做，我们学习了特定于类别的几何对应关系。最后，在两个公共数据集上评估了提出的方法：Scene-15和Caltech-101。仅使用单个描述符，在Scene-15上的识别率就达到85.2％，在Caltech-101上的识别率达到73.3％。实验结果表明，我们的方法明显优于现有技术。

著录项

来源
《Machine Vision and Applications》 |2014年第4期|931-941|共11页
作者
Hong Cheng; Rongchao Yu; Zicheng Liu; Lu Yang; Xue-wen Chen;
展开▼
作者单位

University of Electronic Science and Technology of China, Chengdu, Sichuan, China;

University of Electronic Science and Technology of China, Chengdu, Sichuan, China;

Microsoft Research, Redmond, USA;

University of Electronic Science and Technology of China, Chengdu, Sichuan, China;

Wayne State University, 5057 Woodward Ave, Suite 3010, China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Local kernels; Naive-Bayes Nearest Neighbor; Spatial pyramid matching; Object categorization;

机译：本地内核;朴素贝叶斯最近的邻居;空间金字塔匹配;对象分类;

相似文献

外文文献
中文文献
专利

1. Recognizing In The Depth: Selective 3D Spatial Pyramid Matching Kernel For Object And Scene Categorization [J] . Carolina Redondo-Cabrera, Roberto J. Lopez-Sastre, Javier Acevedo-Rodriguez, Image and Vision Computing . 2014,第12期

机译：深度识别：用于对象和场景分类的选择性3D空间金字塔匹配内核
2. Learning to Combine Kernels for Object Categorization [J] . Deyuan Zhang, Bingquan Liu, Chengjie Sun, Computer and information science . 2011,第3期

机译：学习合并内核以进行对象分类
3. Learning to Combine Kernels for Object Categorization [J] . Deyuan Zhang, Bingquan Liu, Chengjie Sun, Computer and Information Science . 2011,第3期

机译：学习合并内核以进行对象分类
4. A Pyramid Nearest Neighbor Search Kernel for object categorization [C] . Cheng, Hong, Yu, Rongchao, Liu, Zicheng, ICPR 2012;International Conference on Pattern Recognition . 2012

机译：一个用于对象分类的金字塔最近邻居搜索内核
5. Methods for efficient object categorization, detection, scene recognition, and image search. [D] . Bergamo, Alessandro. 2014

机译：高效的对象分类，检测，场景识别和图像搜索方法。
6. Improved Multiscale Entropy Technique with Nearest-Neighbor Moving-Average Kernel for Nonlinear and Nonstationary Short-Time Biomedical Signal Analysis [O] . S. P. Arunachalam, S. Kapa, S. K. Mulpuru, 2018

机译：改进的具有近邻移动平均核的多尺度熵技术用于非线性和非平稳短时生物医学信号分析
7. Freak Descriptor With Spatial Pyramid Kernel For Scene Categorization [O] . Qiong Yao, Xiang Xu 2015

机译：Freak描述符与空间金字塔内核用于场景分类

Kernelized pyramid nearest-neighbor search for object categorization

摘要

著录项

相似文献

相关主题

期刊订阅