Context-adaptive based CU processing for 3D-HEVC.

The 3D High Efficiency Video Coding (3D-HEVC) standard aims to code 3D videos that usually contain multi-view texture videos and its corresponding depth information. It inherits the same quadtree prediction structure of HEVC to code both texture videos and depth maps. Each coding unit (CU) allows re...

Full description

Bibliographic Details
Main Authors: Liquan Shen, Ping An, Zhi Liu
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC5300175?pdf=render
id doaj-9f97234aea884869a1309ccf716d3fa7
record_format Article
spelling doaj-9f97234aea884869a1309ccf716d3fa72020-11-24T22:12:25ZengPublic Library of Science (PLoS)PLoS ONE1932-62032017-01-01122e017101810.1371/journal.pone.0171018Context-adaptive based CU processing for 3D-HEVC.Liquan ShenPing AnZhi LiuThe 3D High Efficiency Video Coding (3D-HEVC) standard aims to code 3D videos that usually contain multi-view texture videos and its corresponding depth information. It inherits the same quadtree prediction structure of HEVC to code both texture videos and depth maps. Each coding unit (CU) allows recursively splitting into four equal sub-CUs. At each CU depth level, it enables 10 types of inter modes and 35 types of intra modes in inter frames. Furthermore, the inter-view prediction tools are applied to each view in the test model of 3D-HEVC (HTM), which uses variable size disparity-compensated prediction to exploit inter-view correlation within neighbor views. It also exploits redundancies between a texture video and its associated depth using inter-component coding tools. These achieve the highest coding efficiency to code 3D videos but require a very high computational complexity. In this paper, we propose a context-adaptive based fast CU processing algorithm to jointly optimize the most complex components of HTM including CU depth level decision, mode decision, motion estimation (ME) and disparity estimation (DE) processes. It is based on the hypothesis that the optimal CU depth level, prediction mode and motion vector of a CU are correlated with those from spatiotemporal, inter-view and inter-component neighboring CUs. We analyze the video content based on coding information from neighboring CUs and early predict each CU into one of five categories i.e., DE-omitted CU, ME-DE-omitted CU, SPLIT CU, Non-SPLIT CU and normal CU, and then each type of CU adaptively adopts different processing strategies. Experimental results show that the proposed algorithm saves 70% encoder runtime on average with only a 0.1% BD-rate increase on coded views and 0.8% BD-rate increase on synthesized views. Our algorithm outperforms the state-of-the-art algorithms in terms of coding time saving or with better RD performance.http://europepmc.org/articles/PMC5300175?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Liquan Shen
Ping An
Zhi Liu
spellingShingle Liquan Shen
Ping An
Zhi Liu
Context-adaptive based CU processing for 3D-HEVC.
PLoS ONE
author_facet Liquan Shen
Ping An
Zhi Liu
author_sort Liquan Shen
title Context-adaptive based CU processing for 3D-HEVC.
title_short Context-adaptive based CU processing for 3D-HEVC.
title_full Context-adaptive based CU processing for 3D-HEVC.
title_fullStr Context-adaptive based CU processing for 3D-HEVC.
title_full_unstemmed Context-adaptive based CU processing for 3D-HEVC.
title_sort context-adaptive based cu processing for 3d-hevc.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2017-01-01
description The 3D High Efficiency Video Coding (3D-HEVC) standard aims to code 3D videos that usually contain multi-view texture videos and its corresponding depth information. It inherits the same quadtree prediction structure of HEVC to code both texture videos and depth maps. Each coding unit (CU) allows recursively splitting into four equal sub-CUs. At each CU depth level, it enables 10 types of inter modes and 35 types of intra modes in inter frames. Furthermore, the inter-view prediction tools are applied to each view in the test model of 3D-HEVC (HTM), which uses variable size disparity-compensated prediction to exploit inter-view correlation within neighbor views. It also exploits redundancies between a texture video and its associated depth using inter-component coding tools. These achieve the highest coding efficiency to code 3D videos but require a very high computational complexity. In this paper, we propose a context-adaptive based fast CU processing algorithm to jointly optimize the most complex components of HTM including CU depth level decision, mode decision, motion estimation (ME) and disparity estimation (DE) processes. It is based on the hypothesis that the optimal CU depth level, prediction mode and motion vector of a CU are correlated with those from spatiotemporal, inter-view and inter-component neighboring CUs. We analyze the video content based on coding information from neighboring CUs and early predict each CU into one of five categories i.e., DE-omitted CU, ME-DE-omitted CU, SPLIT CU, Non-SPLIT CU and normal CU, and then each type of CU adaptively adopts different processing strategies. Experimental results show that the proposed algorithm saves 70% encoder runtime on average with only a 0.1% BD-rate increase on coded views and 0.8% BD-rate increase on synthesized views. Our algorithm outperforms the state-of-the-art algorithms in terms of coding time saving or with better RD performance.
url http://europepmc.org/articles/PMC5300175?pdf=render
work_keys_str_mv AT liquanshen contextadaptivebasedcuprocessingfor3dhevc
AT pingan contextadaptivebasedcuprocessingfor3dhevc
AT zhiliu contextadaptivebasedcuprocessingfor3dhevc
_version_ 1725803837889970176