Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction

In this paper, we propose to use deep neural networks for image compression in the wavelet transform domain. When the input image is transformed from the spatial pixel domain to the wavelet transform domain, one low-frequency sub-band (LF sub-band) and three high-frequency sub-bands (HF sub-bands) a...

Full description

Bibliographic Details
Main Authors:	Chuxi Yang, Yan Zhao, Shigang Wang
Format:	Article
Language:	English
Published:	IEEE 2019-01-01
Series:	IEEE Access
Subjects:	Image coding neural networks discrete wavelet transforms predictive models
Online Access:	https://ieeexplore.ieee.org/document/8692365/

id	doaj-c6885c5fb33c45a2aae282f3c1b37cc9
record_format	Article
spelling	doaj-c6885c5fb33c45a2aae282f3c1b37cc92021-03-29T22:39:10ZengIEEEIEEE Access2169-35362019-01-017524845249710.1109/ACCESS.2019.29114038692365Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band PredictionChuxi Yang0https://orcid.org/0000-0002-8913-654XYan Zhao1Shigang Wang2https://orcid.org/0000-0002-3598-9352College of Communication Engineering, Jilin University, Changchun, ChinaCollege of Communication Engineering, Jilin University, Changchun, ChinaCollege of Communication Engineering, Jilin University, Changchun, ChinaIn this paper, we propose to use deep neural networks for image compression in the wavelet transform domain. When the input image is transformed from the spatial pixel domain to the wavelet transform domain, one low-frequency sub-band (LF sub-band) and three high-frequency sub-bands (HF sub-bands) are generated. Low-frequency sub-band is firstly used to predict each high-frequency sub-band to eliminate redundancy between the sub-bands, after which the sub-bands are fed into different auto-encoders to do the encoding. In order to further improve the compression efficiency, we use a conditional probability model to estimate the context-dependent prior probability of the encoded codes, which can be used for entropy coding. The entire training process is unsupervised, and the auto-encoders and the conditional probability model are trained jointly. The experimental results show that the proposed approach outperforms JPEG, JPEG2000, BPG, and some mainstream neural network-based image compression. Furthermore, it produces better visual quality with clearer details and textures because more high-frequency coefficients can be reserved, thanks to the high-frequency prediction.https://ieeexplore.ieee.org/document/8692365/Image codingneural networksdiscrete wavelet transformspredictive models
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Chuxi Yang Yan Zhao Shigang Wang
spellingShingle	Chuxi Yang Yan Zhao Shigang Wang Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction IEEE Access Image coding neural networks discrete wavelet transforms predictive models
author_facet	Chuxi Yang Yan Zhao Shigang Wang
author_sort	Chuxi Yang
title	Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction
title_short	Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction
title_full	Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction
title_fullStr	Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction
title_full_unstemmed	Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction
title_sort	deep image compression in the wavelet transform domain based on high frequency sub-band prediction
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2019-01-01
description	In this paper, we propose to use deep neural networks for image compression in the wavelet transform domain. When the input image is transformed from the spatial pixel domain to the wavelet transform domain, one low-frequency sub-band (LF sub-band) and three high-frequency sub-bands (HF sub-bands) are generated. Low-frequency sub-band is firstly used to predict each high-frequency sub-band to eliminate redundancy between the sub-bands, after which the sub-bands are fed into different auto-encoders to do the encoding. In order to further improve the compression efficiency, we use a conditional probability model to estimate the context-dependent prior probability of the encoded codes, which can be used for entropy coding. The entire training process is unsupervised, and the auto-encoders and the conditional probability model are trained jointly. The experimental results show that the proposed approach outperforms JPEG, JPEG2000, BPG, and some mainstream neural network-based image compression. Furthermore, it produces better visual quality with clearer details and textures because more high-frequency coefficients can be reserved, thanks to the high-frequency prediction.
topic	Image coding neural networks discrete wavelet transforms predictive models
url	https://ieeexplore.ieee.org/document/8692365/
work_keys_str_mv	AT chuxiyang deepimagecompressioninthewavelettransformdomainbasedonhighfrequencysubbandprediction AT yanzhao deepimagecompressioninthewavelettransformdomainbasedonhighfrequencysubbandprediction AT shigangwang deepimagecompressioninthewavelettransformdomainbasedonhighfrequencysubbandprediction
_version_	1724191201994735616

Deep Image Compression in the Wavelet Transform Domain Based on High Frequency Sub-Band Prediction

Similar Items