首页> 美国卫生研究院文献>other >Efficient Parallel Video Processing Techniques on GPU: From Framework to Implementation

【2h】

Efficient Parallel Video Processing Techniques on GPU: From Framework to Implementation

机译：GPU上的高效并行视频处理技术：从框架到实现

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Through reorganizing the execution order and optimizing the data structure, we proposed an efficient parallel framework for H.264/AVC encoder based on massively parallel architecture. We implemented the proposed framework by CUDA on NVIDIA's GPU. Not only the compute intensive components of the H.264 encoder are parallelized but also the control intensive components are realized effectively, such as CAVLC and deblocking filter. In addition, we proposed serial optimization methods, including the multiresolution multiwindow for motion estimation, multilevel parallel strategy to enhance the parallelism of intracoding as much as possible, component-based parallel CAVLC, and direction-priority deblocking filter. More than 96% of workload of H.264 encoder is offloaded to GPU. Experimental results show that the parallel implementation outperforms the serial program by 20 times of speedup ratio and satisfies the requirement of the real-time HD encoding of 30 fps. The loss of PSNR is from 0.14 dB to 0.77 dB, when keeping the same bitrate. Through the analysis to the kernels, we found that speedup ratios of the compute intensive algorithms are proportional with the computation power of the GPU. However, the performance of the control intensive parts (CAVLC) is much related to the memory bandwidth, which gives an insight for new architecture design.

机译：通过重组执行顺序并优化数据结构，我们提出了一种基于大规模并行架构的H.264 / AVC编码器高效并行框架。我们由CUDA在NVIDIA的GPU上实施了建议的框架。不仅使H.264编码器的计算密集型组件并行化，而且还可以有效地实现控制密集型组件，例如CAVLC和解块滤波器。此外，我们提出了串行优化方法，包括用于运动估计的多分辨率多窗口，尽可能提高帧内编码并行性的多级并行策略，基于组件的并行CAVLC以及方向优先级解块滤波器。 H.264编码器超过96％的工作负载已转移到GPU。实验结果表明，该并行实现的性能比串行程序高20倍，满足了30 fps实时高清编码的要求。当保持相同的比特率时，PSNR的损失为0.14 dB至0.77 dB。通过对内核的分析，我们发现计算密集型算法的加速比与GPU的计算能力成正比。但是，控制密集型部件（CAVLC）的性能与内存带宽密切相关，这为新的体系结构设计提供了见识。

著录项

期刊名称 other
作者
Huayou Su; Mei Wen; Nan Wu; Ju Ren; Chunyuan Zhang;
展开▼
作者单位

展开▼
年(卷),期 -1(2014),-1
年度 -1
页码 716020
总页数 19
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Efficient Parallel Video Processing Techniques on GPU: From Framework to Implementation [J] . HuayouSu, MeiWen, NanWu, ScientificWorldJournal . 2014,第3期

机译：GPU上有效的平行视频处理技术：从框架到实施
2. Efficient Implementation of Hyperspectral Anomaly Detection Techniques on GPUs and Multicore Processors [J] . Molero J.M., Garzon E.M., Garcia I., Selected Topics in Applied Earth Observations and Remote Sensing, IEEE Journal of . 2014,第6期

机译：在GPU和多核处理器上高效实现高光谱异常检测技术
3. HGP4CNN: an efficient parallelization framework for training convolutional neural networks on modern GPUs [J] . Fu Hao, Tang Shanjiang, He Bingsheng, Journal of supercomputing . 2021,第11期

机译：HGP4CNN：用于培训现代GPU的卷积神经网络的有效平行化框架
4. A parallel implementation of IR video processing on a GPU [C] . Jarrah Amin, Mirzaei Golrokh, Majid Mohammad Wadood, IEEE International Midwest Symposium on Circuits and Systems . 2013

机译：在GPU上并行执行IR视频处理
5. A broadly applicable three-dimensional neuron reconstruction framework based on deformable models and software system with parallel GPU implementation [D] . Wang, Yu 2011

机译：基于可变形模型和具有平行GPU实现的可变形模型和软件系统的广泛适用的三维神经元重建框架
6. Corrigendum: Event- and Time-Driven Techniques Using Parallel CPU-GPU Co-processing for Spiking Neural Networks [O] . Francisco Naveros, Jesus A. Garrido, Richard R. Carrillo, 2018

机译：勘误：事件和时间驱动技术使用并行CPU-GPU协处理处理尖刺神经网络
7. Efficient Implementation of Hyperspectral Anomaly Detection Techniques on GPUs and Multicore Processors [O] . Molero Jose M., Garzon E.M., García I., 2014

机译：在GpU和多核处理器上高效实现高光谱异常检测技术
8. Efficient Parallel Algorithm for the Connected Component Problem and Its Implementation on the DPP84 (Delft Parallel Processor) [R] . Peper, F. 1986

机译：连通分量问题的高效并行算法及其在Dpp84上的实现（Delft并行处理器）

Efficient Parallel Video Processing Techniques on GPU: From Framework to Implementation

摘要

著录项

相似文献

相关主题

期刊订阅