首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Accelerated Deformable Part Models on GPUs
【24h】

Accelerated Deformable Part Models on GPUs

机译:GPU上的加速可变形零件模型

获取原文
获取原文并翻译 | 示例
       

摘要

Object detection is a fundamental challenge facing intelligent applications. Image processing is a promising approach to this end, but its computational cost is often a significant problem. This paper presents schemes for accelerating the deformable part models (DPM) on graphics processing units (GPUs). DPM is a well-known algorithm for image-based object detection, and it achieves high detection rates at the expense of computational cost. GPUs are massively parallel compute devices designed to accelerate data-parallel compute-intensive workload. According to an analysis of execution times, approximately 98 percent of DPM code exhibits loop processing, which means that DPM could be highly parallelized by GPUs. In this paper, we implement DPM on the GPU by exploiting multiple parallelization schemes. Results of an experimental evaluation of this GPU-accelerated DPM implementation demonstrate that the best scheme of GPU implementations using an NVIDIA GPU achieves a speed up of 8.6x over a naive CPU-based implementation.
机译:对象检测是智能应用程序面临的一项基本挑战。为此,图像处理是一种很有前途的方法,但是其计算成本通常是一个重大问题。本文提出了在图形处理单元(GPU)上加速可变形零件模型(DPM)的方案。 DPM是用于基于图像的对象检测的著名算法,它以计算成本为代价实现了很高的检测率。 GPU是大规模并行计算设备,旨在加速数据并行计算密集型工作负载。根据对执行时间的分析,大约98%的DPM代码具有循环处理功能,这意味着GPU可以高度并行化DPM。在本文中,我们通过利用多种并行化方案在GPU上实现DPM。对这种GPU加速的DPM实施进行实验评估的结果表明,使用NVIDIA GPU的最佳GPU实施方案比基于朴素的CPU的实施方案的速度提高了8.6倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号