Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction

In this paper, we propose to use deep neural networks for image compression in the wavelet transform domain. When the input image is transformed from the spatial pixel domain to the wavelet transform domain, one low-frequency sub-band (LF sub-band) and three high-frequency sub-bands (HF sub-bands) a...

Full description

Bibliographic Details
Main Authors: Chuxi Yang, Yan Zhao, Shigang Wang
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8692365/
id doaj-c6885c5fb33c45a2aae282f3c1b37cc9
record_format Article
spelling doaj-c6885c5fb33c45a2aae282f3c1b37cc92021-03-29T22:39:10ZengIEEEIEEE Access2169-35362019-01-017524845249710.1109/ACCESS.2019.29114038692365Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band PredictionChuxi Yang0https://orcid.org/0000-0002-8913-654XYan Zhao1Shigang Wang2https://orcid.org/0000-0002-3598-9352College of Communication Engineering, Jilin University, Changchun, ChinaCollege of Communication Engineering, Jilin University, Changchun, ChinaCollege of Communication Engineering, Jilin University, Changchun, ChinaIn this paper, we propose to use deep neural networks for image compression in the wavelet transform domain. When the input image is transformed from the spatial pixel domain to the wavelet transform domain, one low-frequency sub-band (LF sub-band) and three high-frequency sub-bands (HF sub-bands) are generated. Low-frequency sub-band is firstly used to predict each high-frequency sub-band to eliminate redundancy between the sub-bands, after which the sub-bands are fed into different auto-encoders to do the encoding. In order to further improve the compression efficiency, we use a conditional probability model to estimate the context-dependent prior probability of the encoded codes, which can be used for entropy coding. The entire training process is unsupervised, and the auto-encoders and the conditional probability model are trained jointly. The experimental results show that the proposed approach outperforms JPEG, JPEG2000, BPG, and some mainstream neural network-based image compression. Furthermore, it produces better visual quality with clearer details and textures because more high-frequency coefficients can be reserved, thanks to the high-frequency prediction.https://ieeexplore.ieee.org/document/8692365/Image codingneural networksdiscrete wavelet transformspredictive models
collection DOAJ
language English
format Article
sources DOAJ
author Chuxi Yang
Yan Zhao
Shigang Wang
spellingShingle Chuxi Yang
Yan Zhao
Shigang Wang
Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction
IEEE Access
Image coding
neural networks
discrete wavelet transforms
predictive models
author_facet Chuxi Yang
Yan Zhao
Shigang Wang
author_sort Chuxi Yang
title Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction
title_short Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction
title_full Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction
title_fullStr Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction
title_full_unstemmed Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction
title_sort deep image compression in the wavelet transform domain based on high frequency sub-band prediction
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description In this paper, we propose to use deep neural networks for image compression in the wavelet transform domain. When the input image is transformed from the spatial pixel domain to the wavelet transform domain, one low-frequency sub-band (LF sub-band) and three high-frequency sub-bands (HF sub-bands) are generated. Low-frequency sub-band is firstly used to predict each high-frequency sub-band to eliminate redundancy between the sub-bands, after which the sub-bands are fed into different auto-encoders to do the encoding. In order to further improve the compression efficiency, we use a conditional probability model to estimate the context-dependent prior probability of the encoded codes, which can be used for entropy coding. The entire training process is unsupervised, and the auto-encoders and the conditional probability model are trained jointly. The experimental results show that the proposed approach outperforms JPEG, JPEG2000, BPG, and some mainstream neural network-based image compression. Furthermore, it produces better visual quality with clearer details and textures because more high-frequency coefficients can be reserved, thanks to the high-frequency prediction.
topic Image coding
neural networks
discrete wavelet transforms
predictive models
url https://ieeexplore.ieee.org/document/8692365/
work_keys_str_mv AT chuxiyang deepimagecompressioninthewavelettransformdomainbasedonhighfrequencysubbandprediction
AT yanzhao deepimagecompressioninthewavelettransformdomainbasedonhighfrequencysubbandprediction
AT shigangwang deepimagecompressioninthewavelettransformdomainbasedonhighfrequencysubbandprediction
_version_ 1724191201994735616