Multi-level refinement enriched feature pyramid network for object detection

Aziz Lubna; Salam Md. Sah Bin Haji F. C.; Ayub Sara

首页> 外文期刊>Image and Vision Computing >Multi-level refinement enriched feature pyramid network for object detection

【24h】

Multi-level refinement enriched feature pyramid network for object detection

机译：多级细化丰富的特征金字塔网络用于对象检测

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Class Imbalance and scales imbalance are common in object detection. A class imbalance occurs due to insufficient inequality between the number of instances with respect to different classes, while an imbalance in scale occurs when object have different scales and a different number of examples of different scales. In order to solve the problem of scale variance (scale imbalance) and class imbalance together, we propose a simple and effective feature enhancement scheme that explicitly uses all information of a multi-level structure to generate a multilevel contextual features pyramid with multiple scales. We also introduce a cascaded refinement scheme that incorporates multi-scale contextual features into the Single Shot Detector (SSD) predictive layers to improve their distinctiveness for multi-scale detection. A stack of multi-scale contextual feature modules is used in a feature enhancement scheme to merge the multi-level and multi-scale features. Then we collect the equivalent scale features over the Multi-layer Feature Fusion (MLFF) unit to construct a feature pyramid in which each feature map is made up of layers from multiple levels. More robustness and contextual information are integrated into the pyramid through chain parallel pooling operation. To improve classification and regression, a cascaded refinement scheme is proposed that effectively captures a large amount of contextual information and refines the anchors to solve the class imbalance problem. The experiments are carried out on two benchmarks datasets: MS COCO and PASCAL VOC 07/12. Our proposed approach achieves state-of-the-art accuracy with an AP of 40.6 in the case of multi-scale inference on MS COCO Test-dev (input size 320 x 320). For 512 x 512 input on the MS COCO Test-dev, our approach leads in an absolute gain in precision of 1.8% compared to the best reported results of single-stage detector (AP: 45.7). (c) 2021 Elsevier B.V. All rights reserved.

机译：类别不平衡和尺度不平衡在对象检测中很常见。由于不同类别的情况之间的情况之间的不等式不等式而发生级别的不平衡，而当物体具有不同的尺度和不同比例的示例的不同数量的示例，则发生比例的不平衡。为了解决尺度方差（规模不平衡）和类不平衡的问题，我们提出了一种简单有效的特征增强方案，明确使用多级结构的所有信息来生成具有多个尺度的金字塔的多级上下文。我们还介绍了一种级联的细化方案，该方案将多尺度上下文特征结合到单次检测器（SSD）预测层中，以改善它们的多尺度检测的独特性。一堆多尺度上下文特征模块用于特征增强方案，以合并多级和多尺度特征。然后我们收集多层特征融合（MLFF）单元上的等效刻度特征，以构造一个特征金字塔，其中每个特征映射由来自多个级别的层组成。通过链并联池操作将更多的鲁棒性和上下文信息集成到金字塔中。为了提高分类和回归，提出了一种级联的细化方案，以有效地捕获大量的上下文信息并改进锚点以解决类别不平衡问题。实验是在两个基准数据集中进行：COCO和Pascal VOC 07/12。我们所提出的方法在MS Coco Test-Dev上的多尺寸推断的情况下，通过40.6的AP实现最先进的准确性（输入大小320 x 320）。对于MS Coco Test-Dev的512 x 512输入，与单级检测器的最佳报告结果相比，我们的方法以1.8％的精确增益为导致的1.8％（AP：45.7）。（c）2021 elestvier b.v.保留所有权利。

著录项

来源
《Image and Vision Computing》 |2021年第11期|104287.1-104287.11|共11页
作者
Aziz Lubna; Salam Md. Sah Bin Haji F. C.; Ayub Sara;
展开▼
作者单位

Univ Teknol Malaysia Sch Comp Div Artificial Intelligence Fac Engn Skudai 81310 Johor Malaysia|FICT BUITEMS Dept Comp Engn Lahore Pakistan;

Univ Teknol Malaysia Sch Comp Div Artificial Intelligence Fac Engn Skudai 81310 Johor Malaysia;

Univ Teknol Malaysia Dept Elect Engn Fac Engn Skudai 81310 Johor Malaysia;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
CNN; Object detection; Chained parallel pooling; Computer vision; Feature pyramid;

机译：CNN;物体检测;链接并联池;计算机愿景;特征金字塔;

相似文献

外文文献
中文文献
专利

1. Residual feature pyramid networks for salient object detection [J] . Wang Ben, Chen Shuhan, Wang Jian, The Visual Computer . 2020,第9期

机译：用于突出物体检测的残差特征金字塔网络
2. RADet: Refine Feature Pyramid Network and Multi-Layer Attention Network for Arbitrary-Oriented Object Detection of Remote Sensing Images [J] . International journal of applied mechanics . 2020,第3期

机译：雷雾：优化特征金字塔网络和多层注意网络，用于遥感图像的任意面向对象检测
3. A single-shot multi-level feature reused neural network for object detection [J] . Wei Lixin, Cui Wei, Hu Ziyu, The Visual Computer . 2021,第1期

机译：单次多级别功能重复使用神经网络进行对象检测
4. Enriched Feature Guided Refinement Network for Object Detection [C] . Jing Nie, Rao Muhammad Anwer, Hisham Cholakkal, International Conference on Computer Vision . 2019

机译：用于对象检测的丰富的功能导向优化网络
5. Multi-Scale Object Detection in Aerial Images with Feature Pyramid Networks [D] . Bhattarai, Sujan 2018

机译：具有特征金字塔网络的航空影像多尺度目标检测
6. A Novel Pyramid Network with Feature Fusion and Disentanglement for Object Detection [O] . Guoyi Yu, You Wu, Jing Xiao, 2021

机译：一种新的金字塔网络具有对象检测的特征融合和解剖学
7. Small-Object Detection in UAV-Captured Images via Multi-Branch Parallel Feature Pyramid Networks [O] . Yingjie Liu, Fengbao Yang, Peng Hu 2020

机译：通过多分支并行特征金字塔网络在无人机捕获的图像中的小对象检测
8. Keypoint Density-Based Region Proposal for Fine-Grained Object Detection and Classification Using Regions with Convolutional Neural Network Features. [R] . Turner, J. T., Gupta, K., Morris, B., 2015

机译：基于关键点密度的区域提议，用于使用具有卷积神经网络特征的区域进行细粒度目标检测和分类。

Multi-level refinement enriched feature pyramid network for object detection

摘要

著录项

相似文献

相关主题

期刊订阅