An efficient HEVC intra frame coding based on convolutional neural network
碩士 === 義守大學 === 電子工程學系 === 107 === Convolutional neural network (CNN) has been developed rapidly in deep learning areas, and has become a hot studying topic in image applications. High efficiency video coding (HEVC) is the newest video coding standard. The HEVC standard can achieve high coding effic...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2019
|
Online Access: | http://ndltd.ncl.edu.tw/handle/u373by |
id |
ndltd-TW-107ISU05428011 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-107ISU054280112019-09-26T03:28:23Z http://ndltd.ncl.edu.tw/handle/u373by An efficient HEVC intra frame coding based on convolutional neural network 基於卷積神經網路之有效HEVC框內編碼 Yueh-Ju Lu 呂岳儒 碩士 義守大學 電子工程學系 107 Convolutional neural network (CNN) has been developed rapidly in deep learning areas, and has become a hot studying topic in image applications. High efficiency video coding (HEVC) is the newest video coding standard. The HEVC standard can achieve high coding efficiency with a lower bitrate for intra frame coding. However, it still needs many bits to finish best rate-distortion (R-D) curve. Since there are only 35 directions prediction modes provided in intra prediction module (IPM), HEVC occurs a large distortion when the image contents are out of these prediction directions. In other words, HEVC can achieve high coding performance when the image contents match these 35 prediction directions. In order to obtain a better R-D curve, Zhang et al. [3] recently proposed a simple CNN (S-CNN) to improve the encoding performance of HEVC. The S-CNN consists of super-resolution CNN (SRCNN) [7] and ResNet [8] with two layer networks. The S-CNN can precisely predict the residual information of coding tree unit (CTU) and achieve a better R-D performance for HEVC encoder. However, S-CNN has to consume more time to encode intra frame coding since it needs to perform more CNN enhancement mode. In order to further speed up S-CNN based intra frame coding, we propose an early termination algorithm to skip CNN. Because the natural images have high spatial correlation, we find the mean square errors (MSE) of reconstructed CTU exist high spatial correlation at HEVC encoder. Therefore, a dynamic threshold of MSE is set according to four neighboring encoded CTU blocks to evaluate whether the current reconstructed CTU is useful for the CNN enhancement mode. Simulation results show that both our proposed method and S-CNN can reach better R-D curves. On the other hand, although our proposed method increases 0.9% and loses 0.05 dB in average BD-BitRate and BD-PSNR as compared with S-CNN, respectively. However, we can achieve faster HEVC encoding process than S-CNN by reducing time increase ratio (TIR) about 13% on an average. Chou-Chen Wang 王周珍 2019 學位論文 ; thesis 74 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 義守大學 === 電子工程學系 === 107 === Convolutional neural network (CNN) has been developed rapidly in deep learning areas, and has become a hot studying topic in image applications. High efficiency video coding (HEVC) is the newest video coding standard. The HEVC standard can achieve high coding efficiency with a lower bitrate for intra frame coding. However, it still needs many bits to finish best rate-distortion (R-D) curve. Since there are only 35 directions prediction modes provided in intra prediction module (IPM), HEVC occurs a large distortion when the image contents are out of these prediction directions. In other words, HEVC can achieve high coding performance when the image contents match these 35 prediction directions.
In order to obtain a better R-D curve, Zhang et al. [3] recently proposed a simple CNN (S-CNN) to improve the encoding performance of HEVC. The S-CNN consists of super-resolution CNN (SRCNN) [7] and ResNet [8] with two layer networks. The S-CNN can precisely predict the residual information of coding tree unit (CTU) and achieve a better R-D performance for HEVC encoder. However, S-CNN has to consume more time to encode intra frame coding since it needs to perform more CNN enhancement mode.
In order to further speed up S-CNN based intra frame coding, we propose an early termination algorithm to skip CNN. Because the natural images have high spatial correlation, we find the mean square errors (MSE) of reconstructed CTU exist high spatial correlation at HEVC encoder. Therefore, a dynamic threshold of MSE is set according to four neighboring encoded CTU blocks to evaluate whether the current reconstructed CTU is useful for the CNN enhancement mode. Simulation results show that both our proposed method and S-CNN can reach better R-D curves. On the other hand, although our proposed method increases 0.9% and loses 0.05 dB in average BD-BitRate and BD-PSNR as compared with S-CNN, respectively. However, we can achieve faster HEVC encoding process than S-CNN by reducing time increase ratio (TIR) about 13% on an average.
|
author2 |
Chou-Chen Wang |
author_facet |
Chou-Chen Wang Yueh-Ju Lu 呂岳儒 |
author |
Yueh-Ju Lu 呂岳儒 |
spellingShingle |
Yueh-Ju Lu 呂岳儒 An efficient HEVC intra frame coding based on convolutional neural network |
author_sort |
Yueh-Ju Lu |
title |
An efficient HEVC intra frame coding based on convolutional neural network |
title_short |
An efficient HEVC intra frame coding based on convolutional neural network |
title_full |
An efficient HEVC intra frame coding based on convolutional neural network |
title_fullStr |
An efficient HEVC intra frame coding based on convolutional neural network |
title_full_unstemmed |
An efficient HEVC intra frame coding based on convolutional neural network |
title_sort |
efficient hevc intra frame coding based on convolutional neural network |
publishDate |
2019 |
url |
http://ndltd.ncl.edu.tw/handle/u373by |
work_keys_str_mv |
AT yuehjulu anefficienthevcintraframecodingbasedonconvolutionalneuralnetwork AT lǚyuèrú anefficienthevcintraframecodingbasedonconvolutionalneuralnetwork AT yuehjulu jīyújuǎnjīshénjīngwǎnglùzhīyǒuxiàohevckuāngnèibiānmǎ AT lǚyuèrú jīyújuǎnjīshénjīngwǎnglùzhīyǒuxiàohevckuāngnèibiānmǎ AT yuehjulu efficienthevcintraframecodingbasedonconvolutionalneuralnetwork AT lǚyuèrú efficienthevcintraframecodingbasedonconvolutionalneuralnetwork |
_version_ |
1719258257628856320 |