首页> 外文会议>2016 IEEE Nordic Circuits and Systems Conference >OpenCL programmable exposed datapath high performance low-power image signal processor
【24h】

OpenCL programmable exposed datapath high performance low-power image signal processor

机译:OpenCL可编程裸露数据路径高性能低功耗图像信号处理器

获取原文
获取原文并翻译 | 示例

摘要

Sophisticated computational imaging algorithms require both high performance and good energy-efficiency when executed on mobile devices. Recent trend has been to exploit the abundant data-level parallelism found in general purpose programmable GPUs. However, for low-power mobile use cases, generic GPUs consume excessive amounts of power. This paper proposes a programmable computational imaging processor with 16-bit half-precision SIMD floating point vector processing capabilities combined with power efficiency of an exposed datapath. In comparison to traditional VLIW architectures with similar computational resources, the exposed datapath reduces the register file traffic and complexity. These and the specific optimizations enabled by the explicit programming model enable extremely good power-performance. When synthesized on a 28nm ASIC technology, the accelerator consumes 71mW of power while running a state-of-the-art denoising algorithm, and occupies only 0.2mm2 of chip area. For the algorithm, energy usage per frame is 7mJ, which is 10x less than the best found GPU-based implementation.
机译:在移动设备上执行时,复杂的计算成像算法既需要高性能又需要良好的能源效率。最近的趋势是利用在通用可编程GPU中发现的丰富的数据级并行性。但是,对于低功耗移动用例,通用GPU会消耗过多的电量。本文提出了一种可编程计算成像处理器,该处理器具有16位半精度SIMD浮点矢量处理能力,并具有裸露数据路径的功率效率。与具有类似计算资源的传统VLIW体系结构相比,公开的数据路径减少了寄存器文件的通信量和复杂性。通过显式编程模型实现的这些以及特定的优化可实现极佳的电源性能。当使用28nm ASIC技术进行合成时,该加速器在运行最新的去噪算法时消耗71mW的功率,并且仅占用0.2mm2的芯片面积。对于该算法,每帧的能耗为7mJ,比发现的最佳基于GPU的实现少10倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号