...
首页> 外文期刊>Future generation computer systems >FIPIP: A novel fine-grained parallel partition based intra-frame prediction on heterogeneous many-core systems
【24h】

FIPIP: A novel fine-grained parallel partition based intra-frame prediction on heterogeneous many-core systems

机译:FIPIP:异构多核系统上基于新颖细粒度并行分区的帧内预测

获取原文
获取原文并翻译 | 示例
           

摘要

Intra-frame prediction is an important time-consuming component of the widely used H.264/AVC encoder. To speed up prediction, one promising direction is to introduce parallelism and there have been many heterogeneous many-core based approaches proposed. But most of these approaches are limited by their use of highly irregular prediction formulas, which require significant amount of branch instructions. They only use coarse-grained parallel partition, which considers blocks or sub-region of images as parallel processing units. In this paper, by contrast, we propose a fine-grained intra-frame prediction approach based on parallel partition (FIPIP) and implement it on Graphics Processing Unit (GPU) based heterogeneous many-core systems. The approach is characterized by the following aspects. First, our approach takes individual pixels as parallel processing units, instead of blocks. Imposing pixel-level parallelism is capable of fully exploiting the computational power of heterogeneous GPU-based systems and hence tremendously reduces the encoding time. Second, we unify irregular prediction formulas in intra-frame prediction into a well-designed uniform one, and propose a table-lookup method to efficiently perform intra-frame prediction. Our formula can eliminate unnecessary branch instructions by using a unified predictor array, which improves the efficiency of the fine-grained parallel partition significantly. Third, two optimized encoding orders assisted by an improved combined frame strategy are adopted to implement multi-level parallelism. Finally, an efficient self-synchronizing method is realized for finegrained task scheduling on heterogeneous CPU-GPU architecture. We apply FIPIP to encode a set of benchmark videos under varying conditions and compare it with other popular intra-frame prediction methods. Results show that FIPIP outperforms existing state-of-the-art work with speedups factor of 2-6.
机译:帧内预测是广泛使用的H.264 / AVC编码器的重要耗时组件。为了加快预测速度,一个有希望的方向是引入并行性,并且提出了许多基于异构多核的方法。但是,这些方法大多数都受到高度不规则的预测公式的使用的限制,这些公式需要大量的分支指令。他们仅使用粗粒度并行分区,该分区将图像的块或子区域视为并行处理单元。相反,在本文中,我们提出了一种基于并行分区(FIPIP)的细粒度帧内预测方法,并将其实现在基于图形处理单元(GPU)的异构多核系统上。该方法的特征在于以下几个方面。首先,我们的方法将单个像素而不是块作为并行处理单元。施加像素级并行能力能够充分利用基于异构GPU的系统的计算能力,从而极大地减少了编码时间。其次,我们将帧内预测中的不规则预测公式统一为设计良好的统一公式,并提出一种表查找方法来有效执行帧内预测。通过使用统一的预测变量数组,我们的公式可以消除不必要的分支指令,从而显着提高了细粒度并行分区的效率。第三,采用改进的组合帧策略辅助的两个优化编码顺序来实现多级并行性。最后,针对异构CPU-GPU架构上的细粒度任务调度,实现了一种高效的自同步方法。我们应用FIPIP在不同条件下对一组基准视频进行编码,并将其与其他流行的帧内预测方法进行比较。结果表明,FIPIP以2-6的加速系数胜过现有的最新技术。

著录项

  • 来源
    《Future generation computer systems》 |2018年第1期|316-329|共14页
  • 作者单位

    School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China;

    School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China;

    School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China,Department Computer Science, College of Sciences, San Diego State University, San Diego, CA, United States;

    Department Computer Science, College of Sciences, San Diego State University, San Diego, CA, United States;

    School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China;

    Department of Statistics, University of California, Los Angeles, Los Angeles, CA 90095, United States,Department of Cognitive Science, Johns Hopkins University, United States,Department of Computer Science, Johns Hopkins University, United States;

    School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Parallelism; Fine-grained partition; Intra-frame prediction; Fast mode decision; GPU; H.264/AVC;

    机译:并行性细粒度分区;帧内预测;快速模式决策;GPU;H.264 / AVC;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号