Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction
In this paper, we propose to use deep neural networks for image compression in the wavelet transform domain. When the input image is transformed from the spatial pixel domain to the wavelet transform domain, one low-frequency sub-band (LF sub-band) and three high-frequency sub-bands (HF sub-bands) a...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8692365/ |
id |
doaj-c6885c5fb33c45a2aae282f3c1b37cc9 |
---|---|
record_format |
Article |
spelling |
doaj-c6885c5fb33c45a2aae282f3c1b37cc92021-03-29T22:39:10ZengIEEEIEEE Access2169-35362019-01-017524845249710.1109/ACCESS.2019.29114038692365Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band PredictionChuxi Yang0https://orcid.org/0000-0002-8913-654XYan Zhao1Shigang Wang2https://orcid.org/0000-0002-3598-9352College of Communication Engineering, Jilin University, Changchun, ChinaCollege of Communication Engineering, Jilin University, Changchun, ChinaCollege of Communication Engineering, Jilin University, Changchun, ChinaIn this paper, we propose to use deep neural networks for image compression in the wavelet transform domain. When the input image is transformed from the spatial pixel domain to the wavelet transform domain, one low-frequency sub-band (LF sub-band) and three high-frequency sub-bands (HF sub-bands) are generated. Low-frequency sub-band is firstly used to predict each high-frequency sub-band to eliminate redundancy between the sub-bands, after which the sub-bands are fed into different auto-encoders to do the encoding. In order to further improve the compression efficiency, we use a conditional probability model to estimate the context-dependent prior probability of the encoded codes, which can be used for entropy coding. The entire training process is unsupervised, and the auto-encoders and the conditional probability model are trained jointly. The experimental results show that the proposed approach outperforms JPEG, JPEG2000, BPG, and some mainstream neural network-based image compression. Furthermore, it produces better visual quality with clearer details and textures because more high-frequency coefficients can be reserved, thanks to the high-frequency prediction.https://ieeexplore.ieee.org/document/8692365/Image codingneural networksdiscrete wavelet transformspredictive models |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Chuxi Yang Yan Zhao Shigang Wang |
spellingShingle |
Chuxi Yang Yan Zhao Shigang Wang Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction IEEE Access Image coding neural networks discrete wavelet transforms predictive models |
author_facet |
Chuxi Yang Yan Zhao Shigang Wang |
author_sort |
Chuxi Yang |
title |
Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction |
title_short |
Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction |
title_full |
Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction |
title_fullStr |
Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction |
title_full_unstemmed |
Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction |
title_sort |
deep image compression in the wavelet transform domain based on high frequency sub-band prediction |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
In this paper, we propose to use deep neural networks for image compression in the wavelet transform domain. When the input image is transformed from the spatial pixel domain to the wavelet transform domain, one low-frequency sub-band (LF sub-band) and three high-frequency sub-bands (HF sub-bands) are generated. Low-frequency sub-band is firstly used to predict each high-frequency sub-band to eliminate redundancy between the sub-bands, after which the sub-bands are fed into different auto-encoders to do the encoding. In order to further improve the compression efficiency, we use a conditional probability model to estimate the context-dependent prior probability of the encoded codes, which can be used for entropy coding. The entire training process is unsupervised, and the auto-encoders and the conditional probability model are trained jointly. The experimental results show that the proposed approach outperforms JPEG, JPEG2000, BPG, and some mainstream neural network-based image compression. Furthermore, it produces better visual quality with clearer details and textures because more high-frequency coefficients can be reserved, thanks to the high-frequency prediction. |
topic |
Image coding neural networks discrete wavelet transforms predictive models |
url |
https://ieeexplore.ieee.org/document/8692365/ |
work_keys_str_mv |
AT chuxiyang deepimagecompressioninthewavelettransformdomainbasedonhighfrequencysubbandprediction AT yanzhao deepimagecompressioninthewavelettransformdomainbasedonhighfrequencysubbandprediction AT shigangwang deepimagecompressioninthewavelettransformdomainbasedonhighfrequencysubbandprediction |
_version_ |
1724191201994735616 |