Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC
With the development of technology, the hardware requirement and expectations of user for visual enjoyment are getting higher and higher. The multitype tree (MTT) architecture is proposed by the Joint Video Experts Team (JVET). Therefore, it is necessary to determine not only coding unit (CU) depth...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2020-01-01
|
Series: | Scientific Programming |
Online Access: | http://dx.doi.org/10.1155/2020/8883214 |
id |
doaj-bb562422bae0454d9b745e132506b0c0 |
---|---|
record_format |
Article |
spelling |
doaj-bb562422bae0454d9b745e132506b0c02021-07-02T11:47:07ZengHindawi LimitedScientific Programming1058-92441875-919X2020-01-01202010.1155/2020/88832148883214Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVCJinchao Zhao0Yihan Wang1Qiuwen Zhang2College of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, ChinaCollege of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, ChinaCollege of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, ChinaWith the development of technology, the hardware requirement and expectations of user for visual enjoyment are getting higher and higher. The multitype tree (MTT) architecture is proposed by the Joint Video Experts Team (JVET). Therefore, it is necessary to determine not only coding unit (CU) depth but also its split mode in the H.266/Versatile Video Coding (H.266/VVC). Although H.266/VVC achieves significant coding performance on the basis of H.265/High Efficiency Video Coding (H.265/HEVC), it causes significantly coding complexity and increases coding time, where the most time-consuming part is traversal calculation rate-distortion (RD) of CU. To solve these problems, this paper proposes an adaptive CU split decision method based on deep learning and multifeature fusion. Firstly, we develop a texture classification model based on threshold to recognize complex and homogeneous CU. Secondly, if the complex CUs belong to edge CU, a Convolutional Neural Network (CNN) structure based on multifeature fusion is utilized to classify CU. Otherwise, an adaptive CNN structure is used to classify CUs. Finally, the division of CU is determined by the trained network and the parameters of CU. When the complex CUs are split, the above two CNN schemes can successfully process the training samples and terminate the rate-distortion optimization (RDO) calculation for some CUs. The experimental results indicate that the proposed method reduces the computational complexity and saves 39.39% encoding time, thereby achieving fast encoding in H.266/VVC.http://dx.doi.org/10.1155/2020/8883214 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jinchao Zhao Yihan Wang Qiuwen Zhang |
spellingShingle |
Jinchao Zhao Yihan Wang Qiuwen Zhang Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC Scientific Programming |
author_facet |
Jinchao Zhao Yihan Wang Qiuwen Zhang |
author_sort |
Jinchao Zhao |
title |
Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC |
title_short |
Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC |
title_full |
Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC |
title_fullStr |
Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC |
title_full_unstemmed |
Adaptive CU Split Decision Based on Deep Learning and Multifeature Fusion for H.266/VVC |
title_sort |
adaptive cu split decision based on deep learning and multifeature fusion for h.266/vvc |
publisher |
Hindawi Limited |
series |
Scientific Programming |
issn |
1058-9244 1875-919X |
publishDate |
2020-01-01 |
description |
With the development of technology, the hardware requirement and expectations of user for visual enjoyment are getting higher and higher. The multitype tree (MTT) architecture is proposed by the Joint Video Experts Team (JVET). Therefore, it is necessary to determine not only coding unit (CU) depth but also its split mode in the H.266/Versatile Video Coding (H.266/VVC). Although H.266/VVC achieves significant coding performance on the basis of H.265/High Efficiency Video Coding (H.265/HEVC), it causes significantly coding complexity and increases coding time, where the most time-consuming part is traversal calculation rate-distortion (RD) of CU. To solve these problems, this paper proposes an adaptive CU split decision method based on deep learning and multifeature fusion. Firstly, we develop a texture classification model based on threshold to recognize complex and homogeneous CU. Secondly, if the complex CUs belong to edge CU, a Convolutional Neural Network (CNN) structure based on multifeature fusion is utilized to classify CU. Otherwise, an adaptive CNN structure is used to classify CUs. Finally, the division of CU is determined by the trained network and the parameters of CU. When the complex CUs are split, the above two CNN schemes can successfully process the training samples and terminate the rate-distortion optimization (RDO) calculation for some CUs. The experimental results indicate that the proposed method reduces the computational complexity and saves 39.39% encoding time, thereby achieving fast encoding in H.266/VVC. |
url |
http://dx.doi.org/10.1155/2020/8883214 |
work_keys_str_mv |
AT jinchaozhao adaptivecusplitdecisionbasedondeeplearningandmultifeaturefusionforh266vvc AT yihanwang adaptivecusplitdecisionbasedondeeplearningandmultifeaturefusionforh266vvc AT qiuwenzhang adaptivecusplitdecisionbasedondeeplearningandmultifeaturefusionforh266vvc |
_version_ |
1721330801306501120 |