Perceptually Adaptive Lagrangian Multiplier for HEVC Guided Rate-Distortion Optimization

Recent video coding standards typically use the Rate-distortion optimization (RDO) method, which is essential to appropriately perform mode decisions during encoding process. The newest standard high efficiency video coding (HEVC) introduces complex encoding structures and strong dependency between...

Full description

Bibliographic Details
Main Authors: Kais Rouis, Mohamed-Chaker Larabi, Jamel Belhadj Tahar
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8374414/
id doaj-8dfd7ca4934d42349a82bff4f0957447
record_format Article
spelling doaj-8dfd7ca4934d42349a82bff4f09574472021-03-29T21:07:33ZengIEEEIEEE Access2169-35362018-01-016335893360310.1109/ACCESS.2018.28433848374414Perceptually Adaptive Lagrangian Multiplier for HEVC Guided Rate-Distortion OptimizationKais Rouis0https://orcid.org/0000-0002-1709-3683Mohamed-Chaker Larabi1Jamel Belhadj Tahar2National School of Engineering of Tunis, University of Tunis El Manar, Tunis, TunisiaXLIM UMR CNRS 7252, University of Poitiers, Poitiers, FranceNOCCS Laboratory, National School of Engineering of Sousse, University of Sousse, Sousse, TunisiaRecent video coding standards typically use the Rate-distortion optimization (RDO) method, which is essential to appropriately perform mode decisions during encoding process. The newest standard high efficiency video coding (HEVC) introduces complex encoding structures and strong dependency between coding units. Particularly, the Lagrangian multiplier is a primary factor in RDO procedure, which directly affects the rate-distortion (R-D) performance and is defined for an entire video frame. This paper proposes a novel approach for perceptually guiding the RDO process in HEVC. The reference encoder does not consider effectively the perceptual characteristics of the input video and further, the visual sensitivity of each coding tree unit (CTU) in a frame. Inspired by the mechanisms of the human visual system, the proposed solution is a CTU-level adjustment of Lagrangian value based on a set of complementary perceptual features. The proposed scheme concerns important visual information of a CTU and its temporal dependency with adjacent blocks. Feature extraction is implemented in the frequency domain using efficient spatio-temporal analysis. In our experiments, we opted a perceptual mean squared error (MSE) metric and structural similarity (SSIM) index. According to perceptual MSE metric, the BD-rate savings using the Bjontegaard delta measurements, were fairly convincing over the state-of-the-art HEVC software HM16.12; 4.41% and 6.14% for random access (RA) and low delay (LD) encoding settings, respectively. Using SSIM, the BD-Rate achieved 6.95% and 9.86% for RA and LD settings, respectively. The proposed method further demonstrates a superior R-D performance over a compared approach adopting a similar scheme.https://ieeexplore.ieee.org/document/8374414/High efficiency video coding (HEVC)rate-distortion optimizationLagrangian multiplier adjustmentperceptual featuresfrequency domain representation
collection DOAJ
language English
format Article
sources DOAJ
author Kais Rouis
Mohamed-Chaker Larabi
Jamel Belhadj Tahar
spellingShingle Kais Rouis
Mohamed-Chaker Larabi
Jamel Belhadj Tahar
Perceptually Adaptive Lagrangian Multiplier for HEVC Guided Rate-Distortion Optimization
IEEE Access
High efficiency video coding (HEVC)
rate-distortion optimization
Lagrangian multiplier adjustment
perceptual features
frequency domain representation
author_facet Kais Rouis
Mohamed-Chaker Larabi
Jamel Belhadj Tahar
author_sort Kais Rouis
title Perceptually Adaptive Lagrangian Multiplier for HEVC Guided Rate-Distortion Optimization
title_short Perceptually Adaptive Lagrangian Multiplier for HEVC Guided Rate-Distortion Optimization
title_full Perceptually Adaptive Lagrangian Multiplier for HEVC Guided Rate-Distortion Optimization
title_fullStr Perceptually Adaptive Lagrangian Multiplier for HEVC Guided Rate-Distortion Optimization
title_full_unstemmed Perceptually Adaptive Lagrangian Multiplier for HEVC Guided Rate-Distortion Optimization
title_sort perceptually adaptive lagrangian multiplier for hevc guided rate-distortion optimization
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2018-01-01
description Recent video coding standards typically use the Rate-distortion optimization (RDO) method, which is essential to appropriately perform mode decisions during encoding process. The newest standard high efficiency video coding (HEVC) introduces complex encoding structures and strong dependency between coding units. Particularly, the Lagrangian multiplier is a primary factor in RDO procedure, which directly affects the rate-distortion (R-D) performance and is defined for an entire video frame. This paper proposes a novel approach for perceptually guiding the RDO process in HEVC. The reference encoder does not consider effectively the perceptual characteristics of the input video and further, the visual sensitivity of each coding tree unit (CTU) in a frame. Inspired by the mechanisms of the human visual system, the proposed solution is a CTU-level adjustment of Lagrangian value based on a set of complementary perceptual features. The proposed scheme concerns important visual information of a CTU and its temporal dependency with adjacent blocks. Feature extraction is implemented in the frequency domain using efficient spatio-temporal analysis. In our experiments, we opted a perceptual mean squared error (MSE) metric and structural similarity (SSIM) index. According to perceptual MSE metric, the BD-rate savings using the Bjontegaard delta measurements, were fairly convincing over the state-of-the-art HEVC software HM16.12; 4.41% and 6.14% for random access (RA) and low delay (LD) encoding settings, respectively. Using SSIM, the BD-Rate achieved 6.95% and 9.86% for RA and LD settings, respectively. The proposed method further demonstrates a superior R-D performance over a compared approach adopting a similar scheme.
topic High efficiency video coding (HEVC)
rate-distortion optimization
Lagrangian multiplier adjustment
perceptual features
frequency domain representation
url https://ieeexplore.ieee.org/document/8374414/
work_keys_str_mv AT kaisrouis perceptuallyadaptivelagrangianmultiplierforhevcguidedratedistortionoptimization
AT mohamedchakerlarabi perceptuallyadaptivelagrangianmultiplierforhevcguidedratedistortionoptimization
AT jamelbelhadjtahar perceptuallyadaptivelagrangianmultiplierforhevcguidedratedistortionoptimization
_version_ 1724193499722547200