Image Compression Using LSTM/GRU Based Recurrent Convolution Neural Network
碩士 === 國立臺灣科技大學 === 電機工程系 === 106 === Performances of Image/video encoding methods, such as robustness to error attack and compression ratio, are important for multimedia communication applications. The JPEG image compression standard adopts block-based discrete cosine transform (DCT) and the JPEG20...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2018
|
Online Access: | http://ndltd.ncl.edu.tw/handle/bqk75g |
id |
ndltd-TW-106NTUS5442117 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-106NTUS54421172019-05-16T00:59:40Z http://ndltd.ncl.edu.tw/handle/bqk75g Image Compression Using LSTM/GRU Based Recurrent Convolution Neural Network 基於LSTM/GRU結構循環神經網路圖像壓縮架構 Ting-Xuan Wu 吳亭萱 碩士 國立臺灣科技大學 電機工程系 106 Performances of Image/video encoding methods, such as robustness to error attack and compression ratio, are important for multimedia communication applications. The JPEG image compression standard adopts block-based discrete cosine transform (DCT) and the JPEG2000 utilize wavelet transform (WT) to provide multi-resolution compression. Both standards are efficient for current multimedia communication standards. In this research, we study how to utilize deep learning methods to compress image signals, which can provide comparable performances with DCT-based JPEG and WT-based JPEG2000. We proposed an auto encoder architecture for image compression based on a multi-layer recurrent convolutional neural network that comprises an encoder and a decoder sub models. For network nodes, we use long short-term memory network (LSTM) or GRU (Gated Recurrent Units) to enable efficient information delivery. In the training phase, both encoding and decoding procedures are trained together to reserve the most image feature information during compression and decompression. . In the testing phase, it executes encoding and decoding procedures separately to verify performances. Experiments showed that when BPP > 0.8, the PSNR and MS-SSIM (Multiscale Structure Similarity) performances of the proposed methods are better than those of JPEG. For high compression BPP < 0.8, the proposed method outperforms JPEG and others in MS-SSIM, which demonstrates better visual perception performance, i.e., sharper edge, rich textures and fewer artifacts. Jiann-Jone Chen 陳建中 2018 學位論文 ; thesis 65 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣科技大學 === 電機工程系 === 106 === Performances of Image/video encoding methods, such as robustness to error attack and compression ratio, are important for multimedia communication applications. The JPEG image compression standard adopts block-based discrete cosine transform (DCT) and the JPEG2000 utilize wavelet transform (WT) to provide multi-resolution compression. Both standards are efficient for current multimedia communication standards. In this research, we study how to utilize deep learning methods to compress image signals, which can provide comparable performances with DCT-based JPEG and WT-based JPEG2000.
We proposed an auto encoder architecture for image compression based on a multi-layer recurrent convolutional neural network that comprises an encoder and a decoder sub models. For network nodes, we use long short-term memory network (LSTM) or GRU (Gated Recurrent Units) to enable efficient information delivery. In the training phase, both encoding and decoding procedures are trained together to reserve the most image feature information during compression and decompression. . In the testing phase, it executes encoding and decoding procedures separately to verify performances. Experiments showed that when BPP > 0.8, the PSNR and MS-SSIM (Multiscale Structure Similarity) performances of the proposed methods are better than those of JPEG. For high compression BPP < 0.8, the proposed method outperforms JPEG and others in MS-SSIM, which demonstrates better visual perception performance, i.e., sharper edge, rich textures and fewer artifacts.
|
author2 |
Jiann-Jone Chen |
author_facet |
Jiann-Jone Chen Ting-Xuan Wu 吳亭萱 |
author |
Ting-Xuan Wu 吳亭萱 |
spellingShingle |
Ting-Xuan Wu 吳亭萱 Image Compression Using LSTM/GRU Based Recurrent Convolution Neural Network |
author_sort |
Ting-Xuan Wu |
title |
Image Compression Using LSTM/GRU Based Recurrent Convolution Neural Network |
title_short |
Image Compression Using LSTM/GRU Based Recurrent Convolution Neural Network |
title_full |
Image Compression Using LSTM/GRU Based Recurrent Convolution Neural Network |
title_fullStr |
Image Compression Using LSTM/GRU Based Recurrent Convolution Neural Network |
title_full_unstemmed |
Image Compression Using LSTM/GRU Based Recurrent Convolution Neural Network |
title_sort |
image compression using lstm/gru based recurrent convolution neural network |
publishDate |
2018 |
url |
http://ndltd.ncl.edu.tw/handle/bqk75g |
work_keys_str_mv |
AT tingxuanwu imagecompressionusinglstmgrubasedrecurrentconvolutionneuralnetwork AT wútíngxuān imagecompressionusinglstmgrubasedrecurrentconvolutionneuralnetwork AT tingxuanwu jīyúlstmgrujiégòuxúnhuánshénjīngwǎnglùtúxiàngyāsuōjiàgòu AT wútíngxuān jīyúlstmgrujiégòuxúnhuánshénjīngwǎnglùtúxiàngyāsuōjiàgòu |
_version_ |
1719172432903798784 |