首页> 中文期刊> 《计算机学报》 >基于位宽控制提高SIMD架构并行度的优化算法

基于位宽控制提高SIMD架构并行度的优化算法

         

摘要

随着SIMD功能单元作为多媒体加速部件的广泛应用,如何有效利用这一构架优化应用程序成为编译优化研究的热点.目前典型的SIMD结构为同一操作对不同的数据化宽提供了不同的指令版本,随着操作数位宽的增加,对应的SIMD指令可同时完成的操作个数也随之降低.因此,如何有效识别操作数的有效位宽,对提高优化过程中SIMD指令内操作的并行度将产生至关重要的影响.文中针对SIMD优化面临的并行度问题,提出了一种优化算法,该算法在对操作数的有效位进行分析的基础上,进行溢出控制,从而减少操作数对宽位宽数据类型的依赖.实验数据表明,该算法可以有效提高多媒体程序优化的并行度,对多媒体程序获得较好的加速效果.%Although the SIMD units have been widely used in different architecture designs, the automatic optimizations for such architectures are not well developed yet. Since most optimiza-tions for SIMD architectures are transplanted from traditional vectorization techniques, many spe-cial features of SIMD architectures, such as packed operations, have not been thoroughly consid-ered. While operands are tightly packed within a register, there is no spare space to indicate over-flow. To maintain the accuracy of automatic SIMDized programs, the operands should be un-packed to preserve enough space for interim overflow. However, such a strategy would lead to great overhead. Moreover, the additional instructions for handling overflows can sometimes pre-vent other optimizations. In this paper, a new technique, BCSA (Bitwidth controlled SIMD arith-metic), is proposed to reduce the negative effects caused by interim overflow handling and elimi-nate the interference of interim overflows. The algorithm is applied to the multimedia benchmarks of Berkeley. The experimental results show that the algorithm can significantly improve the per-formance of multimedia applications.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号