Image/Video Super-Resolution and Fusion Using Visual Attention Model, Artificial Neural Network, and Particle Swarm Optimization
博士 === 國立中正大學 === 資訊工程研究所 === 99 === Human perception tends to firstly pick attended regions, which correspond to prominent objects in an image. Visual attention region detection simulates the behavior of the human visual system (HVS) and detects regions of interest (ROIs) in the image. Artificial n...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2011
|
Online Access: | http://ndltd.ncl.edu.tw/handle/14706893808993433668 |
id |
ndltd-TW-099CCU00392062 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-099CCU003920622016-04-13T04:17:18Z http://ndltd.ncl.edu.tw/handle/14706893808993433668 Image/Video Super-Resolution and Fusion Using Visual Attention Model, Artificial Neural Network, and Particle Swarm Optimization 以視覺注意模型、類神經網路、及粒子群最佳化作影像/視訊解析度增強及融合 Chen, Hsuan-Ying 陳軒盈 博士 國立中正大學 資訊工程研究所 99 Human perception tends to firstly pick attended regions, which correspond to prominent objects in an image. Visual attention region detection simulates the behavior of the human visual system (HVS) and detects regions of interest (ROIs) in the image. Artificial neural network (ANN) is a biologically motivated learning machine, which simulates the structure and behavior of the nervous system. The mathematical model comprises individual processing units called neurons that resemble neural activity. ANN is a powerful tool for dealing with nonlinearities. Particle swarm optimization (PSO), a population-based optimization algorithm, belongs to an evolutionary computation paradigm. It is an off-line optimization and suitable for solving a complex problem at low cost. In this thesis, first, a visual attention region detection approach using low-level texture and object features is addressed. The new and improved (shifted) functions are proposed and used in both the proposed texture and object features to ensure that all attended pixels will be extracted. The proposed approach can generate high-quality spatial saliency maps in an effective manner. Second, a saliency-directed image interpolation approach using PSO is addressed. A block-based saliency map of an image to be interpolated is generated by the modified visual attention model in an effective manner. Then, based on the block-based saliency map, bilinear interpolation and PSO interpolation are employed for the pixels in “non-saliency” blocks and “saliency” blocks, respectively, to obtain the final interpolation results. Third, a saliency-directed color image interpolation approach using ANN and PSO is addressed. A high-quality saliency map of a color image to be interpolated is generated by the modified block-based visual attention model in an effective manner. Then, based on the saliency map, bilinear interpolation and ANN-PSO interpolation are employed for “non-saliency” blocks (non-ROIs) and “saliency” blocks (ROIs), respectively, to obtain the final color interpolation results. Fourth, a learning-based video super-resolution (SR) reconstruction approach using PSO is proposed. A motion-compensated volume containing five motion-compensated patches and the edge orientation of the volume are extracted and determined, respectively, for each pixel in the “central” reference low-resolution (LR) video frame. Then, the pixel values of the “central” reference high-resolution (HR) video frame are reconstructed by using the corresponding SR reconstruction filtering masks, based on the volume edge orientations and the coordinates of the pixels to be reconstructed. Fifth, a multispectral and multiresolution image fusion approach using PSO is proposed. The pixels of fused images in the training set are classified into several categories based on the characteristics of LR multispectral (MS) images. Then, the smooth parameters of spatial and spectral responses between the HR panchromatic (PAN) and LR MS images are determined by PSO. All the pixels within each category are normalized by its own smooth parameter. Leou, Jin-Jang 柳金章 2011 學位論文 ; thesis 173 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
博士 === 國立中正大學 === 資訊工程研究所 === 99 === Human perception tends to firstly pick attended regions, which correspond to prominent objects in an image. Visual attention region detection simulates the behavior of the human visual system (HVS) and detects regions of interest (ROIs) in the image. Artificial neural network (ANN) is a biologically motivated learning machine, which simulates the structure and behavior of the nervous system. The mathematical model comprises individual processing units called neurons that resemble neural activity. ANN is a powerful tool for dealing with nonlinearities. Particle swarm optimization (PSO), a population-based optimization algorithm, belongs to an evolutionary computation paradigm. It is an off-line optimization and suitable for solving a complex problem at low cost.
In this thesis, first, a visual attention region detection approach using low-level texture and object features is addressed. The new and improved (shifted) functions are proposed and used in both the proposed texture and object features to ensure that all attended pixels will be extracted. The proposed approach can generate high-quality spatial saliency maps in an effective manner. Second, a saliency-directed image interpolation approach using PSO is addressed. A block-based saliency map of an image to be interpolated is generated by the modified visual attention model in an effective manner. Then, based on the block-based saliency map, bilinear interpolation and PSO interpolation are employed for the pixels in “non-saliency” blocks and “saliency” blocks, respectively, to obtain the final interpolation results. Third, a saliency-directed color image interpolation approach using ANN and PSO is addressed. A high-quality saliency map of a color image to be interpolated is generated by the modified block-based visual attention model in an effective manner. Then, based on the saliency map, bilinear interpolation and ANN-PSO interpolation are employed for “non-saliency” blocks (non-ROIs) and “saliency” blocks (ROIs), respectively, to obtain the final color interpolation results. Fourth, a learning-based video super-resolution (SR) reconstruction approach using PSO is proposed. A motion-compensated volume containing five motion-compensated patches and the edge orientation of the volume are extracted and determined, respectively, for each pixel in the “central” reference low-resolution (LR) video frame. Then, the pixel values of the “central” reference high-resolution (HR) video frame are reconstructed by using the corresponding SR reconstruction filtering masks, based on the volume edge orientations and the coordinates of the pixels to be reconstructed. Fifth, a multispectral and multiresolution image fusion approach using PSO is proposed. The pixels of fused images in the training set are classified into several categories based on the characteristics of LR multispectral (MS) images. Then, the smooth parameters of spatial and spectral responses between the HR panchromatic (PAN) and LR MS images are determined by PSO. All the pixels within each category are normalized by its own smooth parameter.
|
author2 |
Leou, Jin-Jang |
author_facet |
Leou, Jin-Jang Chen, Hsuan-Ying 陳軒盈 |
author |
Chen, Hsuan-Ying 陳軒盈 |
spellingShingle |
Chen, Hsuan-Ying 陳軒盈 Image/Video Super-Resolution and Fusion Using Visual Attention Model, Artificial Neural Network, and Particle Swarm Optimization |
author_sort |
Chen, Hsuan-Ying |
title |
Image/Video Super-Resolution and Fusion Using Visual Attention Model, Artificial Neural Network, and Particle Swarm Optimization |
title_short |
Image/Video Super-Resolution and Fusion Using Visual Attention Model, Artificial Neural Network, and Particle Swarm Optimization |
title_full |
Image/Video Super-Resolution and Fusion Using Visual Attention Model, Artificial Neural Network, and Particle Swarm Optimization |
title_fullStr |
Image/Video Super-Resolution and Fusion Using Visual Attention Model, Artificial Neural Network, and Particle Swarm Optimization |
title_full_unstemmed |
Image/Video Super-Resolution and Fusion Using Visual Attention Model, Artificial Neural Network, and Particle Swarm Optimization |
title_sort |
image/video super-resolution and fusion using visual attention model, artificial neural network, and particle swarm optimization |
publishDate |
2011 |
url |
http://ndltd.ncl.edu.tw/handle/14706893808993433668 |
work_keys_str_mv |
AT chenhsuanying imagevideosuperresolutionandfusionusingvisualattentionmodelartificialneuralnetworkandparticleswarmoptimization AT chénxuānyíng imagevideosuperresolutionandfusionusingvisualattentionmodelartificialneuralnetworkandparticleswarmoptimization AT chenhsuanying yǐshìjuézhùyìmóxínglèishénjīngwǎnglùjílìziqúnzuìjiāhuàzuòyǐngxiàngshìxùnjiěxīdùzēngqiángjírónghé AT chénxuānyíng yǐshìjuézhùyìmóxínglèishénjīngwǎnglùjílìziqúnzuìjiāhuàzuòyǐngxiàngshìxùnjiěxīdùzēngqiángjírónghé |
_version_ |
1718222253949190144 |