Image Compression Using LSTM/GRU Based Recurrent Convolution Neural Network

碩士 === 國立臺灣科技大學 === 電機工程系 === 106 === Performances of Image/video encoding methods, such as robustness to error attack and compression ratio, are important for multimedia communication applications. The JPEG image compression standard adopts block-based discrete cosine transform (DCT) and the JPEG20...

Full description

Bibliographic Details
Main Authors: Ting-Xuan Wu, 吳亭萱
Other Authors: Jiann-Jone Chen
Format: Others
Language:zh-TW
Published: 2018
Online Access:http://ndltd.ncl.edu.tw/handle/bqk75g
id ndltd-TW-106NTUS5442117
record_format oai_dc
spelling ndltd-TW-106NTUS54421172019-05-16T00:59:40Z http://ndltd.ncl.edu.tw/handle/bqk75g Image Compression Using LSTM/GRU Based Recurrent Convolution Neural Network 基於LSTM/GRU結構循環神經網路圖像壓縮架構 Ting-Xuan Wu 吳亭萱 碩士 國立臺灣科技大學 電機工程系 106 Performances of Image/video encoding methods, such as robustness to error attack and compression ratio, are important for multimedia communication applications. The JPEG image compression standard adopts block-based discrete cosine transform (DCT) and the JPEG2000 utilize wavelet transform (WT) to provide multi-resolution compression. Both standards are efficient for current multimedia communication standards. In this research, we study how to utilize deep learning methods to compress image signals, which can provide comparable performances with DCT-based JPEG and WT-based JPEG2000. We proposed an auto encoder architecture for image compression based on a multi-layer recurrent convolutional neural network that comprises an encoder and a decoder sub models. For network nodes, we use long short-term memory network (LSTM) or GRU (Gated Recurrent Units) to enable efficient information delivery. In the training phase, both encoding and decoding procedures are trained together to reserve the most image feature information during compression and decompression. . In the testing phase, it executes encoding and decoding procedures separately to verify performances. Experiments showed that when BPP > 0.8, the PSNR and MS-SSIM (Multiscale Structure Similarity) performances of the proposed methods are better than those of JPEG. For high compression BPP < 0.8, the proposed method outperforms JPEG and others in MS-SSIM, which demonstrates better visual perception performance, i.e., sharper edge, rich textures and fewer artifacts. Jiann-Jone Chen 陳建中 2018 學位論文 ; thesis 65 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺灣科技大學 === 電機工程系 === 106 === Performances of Image/video encoding methods, such as robustness to error attack and compression ratio, are important for multimedia communication applications. The JPEG image compression standard adopts block-based discrete cosine transform (DCT) and the JPEG2000 utilize wavelet transform (WT) to provide multi-resolution compression. Both standards are efficient for current multimedia communication standards. In this research, we study how to utilize deep learning methods to compress image signals, which can provide comparable performances with DCT-based JPEG and WT-based JPEG2000. We proposed an auto encoder architecture for image compression based on a multi-layer recurrent convolutional neural network that comprises an encoder and a decoder sub models. For network nodes, we use long short-term memory network (LSTM) or GRU (Gated Recurrent Units) to enable efficient information delivery. In the training phase, both encoding and decoding procedures are trained together to reserve the most image feature information during compression and decompression. . In the testing phase, it executes encoding and decoding procedures separately to verify performances. Experiments showed that when BPP > 0.8, the PSNR and MS-SSIM (Multiscale Structure Similarity) performances of the proposed methods are better than those of JPEG. For high compression BPP < 0.8, the proposed method outperforms JPEG and others in MS-SSIM, which demonstrates better visual perception performance, i.e., sharper edge, rich textures and fewer artifacts.
author2 Jiann-Jone Chen
author_facet Jiann-Jone Chen
Ting-Xuan Wu
吳亭萱
author Ting-Xuan Wu
吳亭萱
spellingShingle Ting-Xuan Wu
吳亭萱
Image Compression Using LSTM/GRU Based Recurrent Convolution Neural Network
author_sort Ting-Xuan Wu
title Image Compression Using LSTM/GRU Based Recurrent Convolution Neural Network
title_short Image Compression Using LSTM/GRU Based Recurrent Convolution Neural Network
title_full Image Compression Using LSTM/GRU Based Recurrent Convolution Neural Network
title_fullStr Image Compression Using LSTM/GRU Based Recurrent Convolution Neural Network
title_full_unstemmed Image Compression Using LSTM/GRU Based Recurrent Convolution Neural Network
title_sort image compression using lstm/gru based recurrent convolution neural network
publishDate 2018
url http://ndltd.ncl.edu.tw/handle/bqk75g
work_keys_str_mv AT tingxuanwu imagecompressionusinglstmgrubasedrecurrentconvolutionneuralnetwork
AT wútíngxuān imagecompressionusinglstmgrubasedrecurrentconvolutionneuralnetwork
AT tingxuanwu jīyúlstmgrujiégòuxúnhuánshénjīngwǎnglùtúxiàngyāsuōjiàgòu
AT wútíngxuān jīyúlstmgrujiégòuxúnhuánshénjīngwǎnglùtúxiàngyāsuōjiàgòu
_version_ 1719172432903798784