Perceptually Adaptive Lagrangian Multiplier for HEVC Guided Rate-Distortion Optimization
Recent video coding standards typically use the Rate-distortion optimization (RDO) method, which is essential to appropriately perform mode decisions during encoding process. The newest standard high efficiency video coding (HEVC) introduces complex encoding structures and strong dependency between...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2018-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8374414/ |
id |
doaj-8dfd7ca4934d42349a82bff4f0957447 |
---|---|
record_format |
Article |
spelling |
doaj-8dfd7ca4934d42349a82bff4f09574472021-03-29T21:07:33ZengIEEEIEEE Access2169-35362018-01-016335893360310.1109/ACCESS.2018.28433848374414Perceptually Adaptive Lagrangian Multiplier for HEVC Guided Rate-Distortion OptimizationKais Rouis0https://orcid.org/0000-0002-1709-3683Mohamed-Chaker Larabi1Jamel Belhadj Tahar2National School of Engineering of Tunis, University of Tunis El Manar, Tunis, TunisiaXLIM UMR CNRS 7252, University of Poitiers, Poitiers, FranceNOCCS Laboratory, National School of Engineering of Sousse, University of Sousse, Sousse, TunisiaRecent video coding standards typically use the Rate-distortion optimization (RDO) method, which is essential to appropriately perform mode decisions during encoding process. The newest standard high efficiency video coding (HEVC) introduces complex encoding structures and strong dependency between coding units. Particularly, the Lagrangian multiplier is a primary factor in RDO procedure, which directly affects the rate-distortion (R-D) performance and is defined for an entire video frame. This paper proposes a novel approach for perceptually guiding the RDO process in HEVC. The reference encoder does not consider effectively the perceptual characteristics of the input video and further, the visual sensitivity of each coding tree unit (CTU) in a frame. Inspired by the mechanisms of the human visual system, the proposed solution is a CTU-level adjustment of Lagrangian value based on a set of complementary perceptual features. The proposed scheme concerns important visual information of a CTU and its temporal dependency with adjacent blocks. Feature extraction is implemented in the frequency domain using efficient spatio-temporal analysis. In our experiments, we opted a perceptual mean squared error (MSE) metric and structural similarity (SSIM) index. According to perceptual MSE metric, the BD-rate savings using the Bjontegaard delta measurements, were fairly convincing over the state-of-the-art HEVC software HM16.12; 4.41% and 6.14% for random access (RA) and low delay (LD) encoding settings, respectively. Using SSIM, the BD-Rate achieved 6.95% and 9.86% for RA and LD settings, respectively. The proposed method further demonstrates a superior R-D performance over a compared approach adopting a similar scheme.https://ieeexplore.ieee.org/document/8374414/High efficiency video coding (HEVC)rate-distortion optimizationLagrangian multiplier adjustmentperceptual featuresfrequency domain representation |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Kais Rouis Mohamed-Chaker Larabi Jamel Belhadj Tahar |
spellingShingle |
Kais Rouis Mohamed-Chaker Larabi Jamel Belhadj Tahar Perceptually Adaptive Lagrangian Multiplier for HEVC Guided Rate-Distortion Optimization IEEE Access High efficiency video coding (HEVC) rate-distortion optimization Lagrangian multiplier adjustment perceptual features frequency domain representation |
author_facet |
Kais Rouis Mohamed-Chaker Larabi Jamel Belhadj Tahar |
author_sort |
Kais Rouis |
title |
Perceptually Adaptive Lagrangian Multiplier for HEVC Guided Rate-Distortion Optimization |
title_short |
Perceptually Adaptive Lagrangian Multiplier for HEVC Guided Rate-Distortion Optimization |
title_full |
Perceptually Adaptive Lagrangian Multiplier for HEVC Guided Rate-Distortion Optimization |
title_fullStr |
Perceptually Adaptive Lagrangian Multiplier for HEVC Guided Rate-Distortion Optimization |
title_full_unstemmed |
Perceptually Adaptive Lagrangian Multiplier for HEVC Guided Rate-Distortion Optimization |
title_sort |
perceptually adaptive lagrangian multiplier for hevc guided rate-distortion optimization |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2018-01-01 |
description |
Recent video coding standards typically use the Rate-distortion optimization (RDO) method, which is essential to appropriately perform mode decisions during encoding process. The newest standard high efficiency video coding (HEVC) introduces complex encoding structures and strong dependency between coding units. Particularly, the Lagrangian multiplier is a primary factor in RDO procedure, which directly affects the rate-distortion (R-D) performance and is defined for an entire video frame. This paper proposes a novel approach for perceptually guiding the RDO process in HEVC. The reference encoder does not consider effectively the perceptual characteristics of the input video and further, the visual sensitivity of each coding tree unit (CTU) in a frame. Inspired by the mechanisms of the human visual system, the proposed solution is a CTU-level adjustment of Lagrangian value based on a set of complementary perceptual features. The proposed scheme concerns important visual information of a CTU and its temporal dependency with adjacent blocks. Feature extraction is implemented in the frequency domain using efficient spatio-temporal analysis. In our experiments, we opted a perceptual mean squared error (MSE) metric and structural similarity (SSIM) index. According to perceptual MSE metric, the BD-rate savings using the Bjontegaard delta measurements, were fairly convincing over the state-of-the-art HEVC software HM16.12; 4.41% and 6.14% for random access (RA) and low delay (LD) encoding settings, respectively. Using SSIM, the BD-Rate achieved 6.95% and 9.86% for RA and LD settings, respectively. The proposed method further demonstrates a superior R-D performance over a compared approach adopting a similar scheme. |
topic |
High efficiency video coding (HEVC) rate-distortion optimization Lagrangian multiplier adjustment perceptual features frequency domain representation |
url |
https://ieeexplore.ieee.org/document/8374414/ |
work_keys_str_mv |
AT kaisrouis perceptuallyadaptivelagrangianmultiplierforhevcguidedratedistortionoptimization AT mohamedchakerlarabi perceptuallyadaptivelagrangianmultiplierforhevcguidedratedistortionoptimization AT jamelbelhadjtahar perceptuallyadaptivelagrangianmultiplierforhevcguidedratedistortionoptimization |
_version_ |
1724193499722547200 |