An efficient HEVC intra frame coding based on convolutional neural network

碩士 === 義守大學 === 電子工程學系 === 107 === Convolutional neural network (CNN) has been developed rapidly in deep learning areas, and has become a hot studying topic in image applications. High efficiency video coding (HEVC) is the newest video coding standard. The HEVC standard can achieve high coding effic...

Full description

Bibliographic Details
Main Authors: Yueh-Ju Lu, 呂岳儒
Other Authors: Chou-Chen Wang
Format: Others
Language:zh-TW
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/u373by
id ndltd-TW-107ISU05428011
record_format oai_dc
spelling ndltd-TW-107ISU054280112019-09-26T03:28:23Z http://ndltd.ncl.edu.tw/handle/u373by An efficient HEVC intra frame coding based on convolutional neural network 基於卷積神經網路之有效HEVC框內編碼 Yueh-Ju Lu 呂岳儒 碩士 義守大學 電子工程學系 107 Convolutional neural network (CNN) has been developed rapidly in deep learning areas, and has become a hot studying topic in image applications. High efficiency video coding (HEVC) is the newest video coding standard. The HEVC standard can achieve high coding efficiency with a lower bitrate for intra frame coding. However, it still needs many bits to finish best rate-distortion (R-D) curve. Since there are only 35 directions prediction modes provided in intra prediction module (IPM), HEVC occurs a large distortion when the image contents are out of these prediction directions. In other words, HEVC can achieve high coding performance when the image contents match these 35 prediction directions. In order to obtain a better R-D curve, Zhang et al. [3] recently proposed a simple CNN (S-CNN) to improve the encoding performance of HEVC. The S-CNN consists of super-resolution CNN (SRCNN) [7] and ResNet [8] with two layer networks. The S-CNN can precisely predict the residual information of coding tree unit (CTU) and achieve a better R-D performance for HEVC encoder. However, S-CNN has to consume more time to encode intra frame coding since it needs to perform more CNN enhancement mode. In order to further speed up S-CNN based intra frame coding, we propose an early termination algorithm to skip CNN. Because the natural images have high spatial correlation, we find the mean square errors (MSE) of reconstructed CTU exist high spatial correlation at HEVC encoder. Therefore, a dynamic threshold of MSE is set according to four neighboring encoded CTU blocks to evaluate whether the current reconstructed CTU is useful for the CNN enhancement mode. Simulation results show that both our proposed method and S-CNN can reach better R-D curves. On the other hand, although our proposed method increases 0.9% and loses 0.05 dB in average BD-BitRate and BD-PSNR as compared with S-CNN, respectively. However, we can achieve faster HEVC encoding process than S-CNN by reducing time increase ratio (TIR) about 13% on an average. Chou-Chen Wang 王周珍 2019 學位論文 ; thesis 74 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 義守大學 === 電子工程學系 === 107 === Convolutional neural network (CNN) has been developed rapidly in deep learning areas, and has become a hot studying topic in image applications. High efficiency video coding (HEVC) is the newest video coding standard. The HEVC standard can achieve high coding efficiency with a lower bitrate for intra frame coding. However, it still needs many bits to finish best rate-distortion (R-D) curve. Since there are only 35 directions prediction modes provided in intra prediction module (IPM), HEVC occurs a large distortion when the image contents are out of these prediction directions. In other words, HEVC can achieve high coding performance when the image contents match these 35 prediction directions. In order to obtain a better R-D curve, Zhang et al. [3] recently proposed a simple CNN (S-CNN) to improve the encoding performance of HEVC. The S-CNN consists of super-resolution CNN (SRCNN) [7] and ResNet [8] with two layer networks. The S-CNN can precisely predict the residual information of coding tree unit (CTU) and achieve a better R-D performance for HEVC encoder. However, S-CNN has to consume more time to encode intra frame coding since it needs to perform more CNN enhancement mode. In order to further speed up S-CNN based intra frame coding, we propose an early termination algorithm to skip CNN. Because the natural images have high spatial correlation, we find the mean square errors (MSE) of reconstructed CTU exist high spatial correlation at HEVC encoder. Therefore, a dynamic threshold of MSE is set according to four neighboring encoded CTU blocks to evaluate whether the current reconstructed CTU is useful for the CNN enhancement mode. Simulation results show that both our proposed method and S-CNN can reach better R-D curves. On the other hand, although our proposed method increases 0.9% and loses 0.05 dB in average BD-BitRate and BD-PSNR as compared with S-CNN, respectively. However, we can achieve faster HEVC encoding process than S-CNN by reducing time increase ratio (TIR) about 13% on an average.
author2 Chou-Chen Wang
author_facet Chou-Chen Wang
Yueh-Ju Lu
呂岳儒
author Yueh-Ju Lu
呂岳儒
spellingShingle Yueh-Ju Lu
呂岳儒
An efficient HEVC intra frame coding based on convolutional neural network
author_sort Yueh-Ju Lu
title An efficient HEVC intra frame coding based on convolutional neural network
title_short An efficient HEVC intra frame coding based on convolutional neural network
title_full An efficient HEVC intra frame coding based on convolutional neural network
title_fullStr An efficient HEVC intra frame coding based on convolutional neural network
title_full_unstemmed An efficient HEVC intra frame coding based on convolutional neural network
title_sort efficient hevc intra frame coding based on convolutional neural network
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/u373by
work_keys_str_mv AT yuehjulu anefficienthevcintraframecodingbasedonconvolutionalneuralnetwork
AT lǚyuèrú anefficienthevcintraframecodingbasedonconvolutionalneuralnetwork
AT yuehjulu jīyújuǎnjīshénjīngwǎnglùzhīyǒuxiàohevckuāngnèibiānmǎ
AT lǚyuèrú jīyújuǎnjīshénjīngwǎnglùzhīyǒuxiàohevckuāngnèibiānmǎ
AT yuehjulu efficienthevcintraframecodingbasedonconvolutionalneuralnetwork
AT lǚyuèrú efficienthevcintraframecodingbasedonconvolutionalneuralnetwork
_version_ 1719258257628856320