Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC

With the development of technology, the hardware requirement and expectations of user for visual enjoyment are getting higher and higher. The multitype tree (MTT) architecture is proposed by the Joint Video Experts Team (JVET). Therefore, it is necessary to determine not only coding unit (CU) depth...

Full description

Bibliographic Details
Main Authors:	Jinchao Zhao, Yihan Wang, Qiuwen Zhang
Format:	Article
Language:	English
Published:	Hindawi Limited 2020-01-01
Series:	Scientific Programming
Online Access:	http://dx.doi.org/10.1155/2020/8883214

id	doaj-bb562422bae0454d9b745e132506b0c0
record_format	Article
spelling	doaj-bb562422bae0454d9b745e132506b0c02021-07-02T11:47:07ZengHindawi LimitedScientific Programming1058-92441875-919X2020-01-01202010.1155/2020/88832148883214Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVCJinchao Zhao0Yihan Wang1Qiuwen Zhang2College of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, ChinaCollege of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, ChinaCollege of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, ChinaWith the development of technology, the hardware requirement and expectations of user for visual enjoyment are getting higher and higher. The multitype tree (MTT) architecture is proposed by the Joint Video Experts Team (JVET). Therefore, it is necessary to determine not only coding unit (CU) depth but also its split mode in the H.266/Versatile Video Coding (H.266/VVC). Although H.266/VVC achieves significant coding performance on the basis of H.265/High Efficiency Video Coding (H.265/HEVC), it causes significantly coding complexity and increases coding time, where the most time-consuming part is traversal calculation rate-distortion (RD) of CU. To solve these problems, this paper proposes an adaptive CU split decision method based on deep learning and multifeature fusion. Firstly, we develop a texture classification model based on threshold to recognize complex and homogeneous CU. Secondly, if the complex CUs belong to edge CU, a Convolutional Neural Network (CNN) structure based on multifeature fusion is utilized to classify CU. Otherwise, an adaptive CNN structure is used to classify CUs. Finally, the division of CU is determined by the trained network and the parameters of CU. When the complex CUs are split, the above two CNN schemes can successfully process the training samples and terminate the rate-distortion optimization (RDO) calculation for some CUs. The experimental results indicate that the proposed method reduces the computational complexity and saves 39.39% encoding time, thereby achieving fast encoding in H.266/VVC.http://dx.doi.org/10.1155/2020/8883214
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Jinchao Zhao Yihan Wang Qiuwen Zhang
spellingShingle	Jinchao Zhao Yihan Wang Qiuwen Zhang Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC Scientific Programming
author_facet	Jinchao Zhao Yihan Wang Qiuwen Zhang
author_sort	Jinchao Zhao
title	Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC
title_short	Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC
title_full	Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC
title_fullStr	Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC
title_full_unstemmed	Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC
title_sort	adaptive cu split decision based on deep learning and multifeature fusion for h.266/vvc
publisher	Hindawi Limited
series	Scientific Programming
issn	1058-9244 1875-919X
publishDate	2020-01-01
description	With the development of technology, the hardware requirement and expectations of user for visual enjoyment are getting higher and higher. The multitype tree (MTT) architecture is proposed by the Joint Video Experts Team (JVET). Therefore, it is necessary to determine not only coding unit (CU) depth but also its split mode in the H.266/Versatile Video Coding (H.266/VVC). Although H.266/VVC achieves significant coding performance on the basis of H.265/High Efficiency Video Coding (H.265/HEVC), it causes significantly coding complexity and increases coding time, where the most time-consuming part is traversal calculation rate-distortion (RD) of CU. To solve these problems, this paper proposes an adaptive CU split decision method based on deep learning and multifeature fusion. Firstly, we develop a texture classification model based on threshold to recognize complex and homogeneous CU. Secondly, if the complex CUs belong to edge CU, a Convolutional Neural Network (CNN) structure based on multifeature fusion is utilized to classify CU. Otherwise, an adaptive CNN structure is used to classify CUs. Finally, the division of CU is determined by the trained network and the parameters of CU. When the complex CUs are split, the above two CNN schemes can successfully process the training samples and terminate the rate-distortion optimization (RDO) calculation for some CUs. The experimental results indicate that the proposed method reduces the computational complexity and saves 39.39% encoding time, thereby achieving fast encoding in H.266/VVC.
url	http://dx.doi.org/10.1155/2020/8883214
work_keys_str_mv	AT jinchaozhao adaptivecusplitdecisionbasedondeeplearningandmultifeaturefusionforh266vvc AT yihanwang adaptivecusplitdecisionbasedondeeplearningandmultifeaturefusionforh266vvc AT qiuwenzhang adaptivecusplitdecisionbasedondeeplearningandmultifeaturefusionforh266vvc
_version_	1721330801306501120

Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC

Similar Items