Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC

With the development of technology, the hardware requirement and expectations of user for visual enjoyment are getting higher and higher. The multitype tree (MTT) architecture is proposed by the Joint Video Experts Team (JVET). Therefore, it is necessary to determine not only coding unit (CU) depth...

Full description

Bibliographic Details
Main Authors: Jinchao Zhao, Yihan Wang, Qiuwen Zhang
Format: Article
Language:English
Published: Hindawi Limited 2020-01-01
Series:Scientific Programming
Online Access:http://dx.doi.org/10.1155/2020/8883214
id doaj-bb562422bae0454d9b745e132506b0c0
record_format Article
spelling doaj-bb562422bae0454d9b745e132506b0c02021-07-02T11:47:07ZengHindawi LimitedScientific Programming1058-92441875-919X2020-01-01202010.1155/2020/88832148883214Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVCJinchao Zhao0Yihan Wang1Qiuwen Zhang2College of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, ChinaCollege of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, ChinaCollege of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, ChinaWith the development of technology, the hardware requirement and expectations of user for visual enjoyment are getting higher and higher. The multitype tree (MTT) architecture is proposed by the Joint Video Experts Team (JVET). Therefore, it is necessary to determine not only coding unit (CU) depth but also its split mode in the H.266/Versatile Video Coding (H.266/VVC). Although H.266/VVC achieves significant coding performance on the basis of H.265/High Efficiency Video Coding (H.265/HEVC), it causes significantly coding complexity and increases coding time, where the most time-consuming part is traversal calculation rate-distortion (RD) of CU. To solve these problems, this paper proposes an adaptive CU split decision method based on deep learning and multifeature fusion. Firstly, we develop a texture classification model based on threshold to recognize complex and homogeneous CU. Secondly, if the complex CUs belong to edge CU, a Convolutional Neural Network (CNN) structure based on multifeature fusion is utilized to classify CU. Otherwise, an adaptive CNN structure is used to classify CUs. Finally, the division of CU is determined by the trained network and the parameters of CU. When the complex CUs are split, the above two CNN schemes can successfully process the training samples and terminate the rate-distortion optimization (RDO) calculation for some CUs. The experimental results indicate that the proposed method reduces the computational complexity and saves 39.39% encoding time, thereby achieving fast encoding in H.266/VVC.http://dx.doi.org/10.1155/2020/8883214
collection DOAJ
language English
format Article
sources DOAJ
author Jinchao Zhao
Yihan Wang
Qiuwen Zhang
spellingShingle Jinchao Zhao
Yihan Wang
Qiuwen Zhang
Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC
Scientific Programming
author_facet Jinchao Zhao
Yihan Wang
Qiuwen Zhang
author_sort Jinchao Zhao
title Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC
title_short Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC
title_full Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC
title_fullStr Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC
title_full_unstemmed Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC
title_sort adaptive cu split decision based on deep learning and multifeature fusion for h.266/vvc
publisher Hindawi Limited
series Scientific Programming
issn 1058-9244
1875-919X
publishDate 2020-01-01
description With the development of technology, the hardware requirement and expectations of user for visual enjoyment are getting higher and higher. The multitype tree (MTT) architecture is proposed by the Joint Video Experts Team (JVET). Therefore, it is necessary to determine not only coding unit (CU) depth but also its split mode in the H.266/Versatile Video Coding (H.266/VVC). Although H.266/VVC achieves significant coding performance on the basis of H.265/High Efficiency Video Coding (H.265/HEVC), it causes significantly coding complexity and increases coding time, where the most time-consuming part is traversal calculation rate-distortion (RD) of CU. To solve these problems, this paper proposes an adaptive CU split decision method based on deep learning and multifeature fusion. Firstly, we develop a texture classification model based on threshold to recognize complex and homogeneous CU. Secondly, if the complex CUs belong to edge CU, a Convolutional Neural Network (CNN) structure based on multifeature fusion is utilized to classify CU. Otherwise, an adaptive CNN structure is used to classify CUs. Finally, the division of CU is determined by the trained network and the parameters of CU. When the complex CUs are split, the above two CNN schemes can successfully process the training samples and terminate the rate-distortion optimization (RDO) calculation for some CUs. The experimental results indicate that the proposed method reduces the computational complexity and saves 39.39% encoding time, thereby achieving fast encoding in H.266/VVC.
url http://dx.doi.org/10.1155/2020/8883214
work_keys_str_mv AT jinchaozhao adaptivecusplitdecisionbasedondeeplearningandmultifeaturefusionforh266vvc
AT yihanwang adaptivecusplitdecisionbasedondeeplearningandmultifeaturefusionforh266vvc
AT qiuwenzhang adaptivecusplitdecisionbasedondeeplearningandmultifeaturefusionforh266vvc
_version_ 1721330801306501120