首页> 外文期刊>Arabian Journal for Science and Engineering >A Hybrid Tactic Model Intended for Video Compression Using Global Affine Motion and Local Free-Form Transformation Parameters
【24h】

A Hybrid Tactic Model Intended for Video Compression Using Global Affine Motion and Local Free-Form Transformation Parameters

机译:用于使用全局仿射运动和局部自由格式转换参数进行视频压缩的混合战术模型

获取原文
获取原文并翻译 | 示例
           

摘要

Video compression marks its necessity when a huge sized video needs to be transmitted. The process starts with the identification of GoP (group of pictures), which depends on I- (intra), B- (bidirectional) and P- (predicted) frames determination. GoP is fixed, where consecutive frames are placed in an orderly manner based on the GoP size. Conventionally, B-frames lead to buffering of memory within the past and future frames consuming more computational time. Such issues are handled by an adaptive framework for determining frames based on matching criteria rather than fixed GoP. NSEW (North–South–East–West) affine translation (NAT) is proposed for replacing B with either I- or P-frame. The proposed framework involves video compression using affine motion-based free-form transformation and video decompression using warping methodologies for the purpose of compressing and decompressing the video sequence, based on the resulted I- and P-frames. B-spline transformation was also initiated at local level along with global affine transformation to improve the subjective quality of the decompressed video sequence. The methodology was investigated for the file size, computational time, peak-signal-to-noise ratio (PSNR) and Structural Similarity index (SSIM), which proved the superiority of the proposed technique. Further, the methodology was also investigated with optimizing the affine motion parameters (AMP) using nonlinear least squares, Broyden–Fletcher–Goldfarb–Shanno (BFGS) and limited-memory BFGS which yet again proved to be far more superior to conventional techniques with an average PSNR of 38.98 dB with LBFGS. To further improve the subjective quality, affine B-spline-based motion estimation using LBFGS was implemented and observed the average PSNR gain to be 42.03 dB.
机译:当需要传输大尺寸视频时,视频压缩标志着其必要性。该过程开始于GoP(图片组)的识别,这取决于I-(帧内),B-(双向)和P-(预测)帧的确定。 GoP是固定的,其中连续的帧根据GoP大小有序放置。按照惯例,B帧会导致过去和未来帧内的内存缓冲,从而消耗更多的计算时间。此类问题由用于基于匹配标准而非固定GoP确定帧的自适应框架处理。提议使用NSEW(北-南-东-西)仿射翻译(NAT)来用I或P框架替换B。所提出的框架涉及使用基于仿射运动的自由形式变换的视频压缩和使用翘曲方法的视频解压缩,以基于所得的I帧和P帧对视频序列进行压缩和解压缩。 B样条变换也与全局仿射变换一起在局部级别启动,以提高解压缩视频序列的主观质量。研究了该方法的文件大小,计算时间,峰信噪比(PSNR)和结构相似性指数(SSIM),证明了该技术的优越性。此外,还通过使用非线性最小二乘法,Broyden-Fletcher-Goldfarb-Shanno(BFGS)和有限内存BFGS来优化仿射运动参数(AMP)的方法进行了研究,这些方法再一次证明远远优于具有LBFGS的平均PSNR为38.98 dB。为了进一步提高主观质量,使用LBFGS进行基于仿射B样条的运动估计,并观察到平均PSNR增益为42.03 dB。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号