Image and video classification and image similarity measurement by learning sparse representation

Sparse representation of signals has recently emerged as a major research area. It is well-known that many natural signals can be sparsely represented using a properly chosen dictionary (e.g. formed of wavelets bases). A dictionary could be complete or overcomplete depending on whether the number of...

Full description

Bibliographic Details
Main Author: Guha, Tanaya
Language:English
Published: University of British Columbia 2013
Online Access:http://hdl.handle.net/2429/45122
id ndltd-UBC-oai-circle.library.ubc.ca-2429-45122
record_format oai_dc
spelling ndltd-UBC-oai-circle.library.ubc.ca-2429-451222018-01-05T17:26:57Z Image and video classification and image similarity measurement by learning sparse representation Guha, Tanaya Sparse representation of signals has recently emerged as a major research area. It is well-known that many natural signals can be sparsely represented using a properly chosen dictionary (e.g. formed of wavelets bases). A dictionary could be complete or overcomplete depending on whether the number of bases it contains is the same or greater than the dimensionality of the given signal. Traditionally, the use of predefined dictionaries has been prevalent in sparse analysis. However, a more generalized approach is to learn the dictionary from the signal itself. Learnt dictionaries are known to outperform predefined dictionaries in several applications. This thesis explores the application of sparse representations of signals obtained by learning overcomplete dictionaries for three applications: 1) classification of images and videos, 2) measurement of similarity between two images, and 3) assessment of perceptual quality of an image. This thesis first capitalizes on the natural discriminative ability of sparse representations to develop efficient classification algorithms. The proposed algorithms are employed in image-based face recognition and video-based human action recognition. They are shown to perform better than the state-of-the-art. The thesis then studies how to obtain a good measure of similarity between two images. Despite the long history of image similarity evaluation, open issues still exist. These include the need of developing generic similarity measures that do not assume any prior knowledge of the task at hand or the data type. This thesis develops a generic image similarity measure based on learning sparse representations. Successful application of the proposed measure to clustering, retrieval and classification of different types of images is demonstrated. The thesis then examines a highly promising approach to assess the perceptual quality of an image. This approach involves comparing the structural information of a possibly distorted image with that in its reference image. The extraction of the structural information that is important to our visual system is a challenging task. A sparse representation-based image quality assessment approach is proposed to address this issue. When compared with seven existing metrics, our method performs the best in three databases and ranks among the top three in the remaining three databases. Applied Science, Faculty of Electrical and Computer Engineering, Department of Graduate 2013-09-24T15:58:43Z 2013-09-24T15:58:43Z 2013 2013-11 Text Thesis/Dissertation http://hdl.handle.net/2429/45122 eng Attribution-NonCommercial-NoDerivs 2.5 Canada http://creativecommons.org/licenses/by-nc-nd/2.5/ca/ University of British Columbia
collection NDLTD
language English
sources NDLTD
description Sparse representation of signals has recently emerged as a major research area. It is well-known that many natural signals can be sparsely represented using a properly chosen dictionary (e.g. formed of wavelets bases). A dictionary could be complete or overcomplete depending on whether the number of bases it contains is the same or greater than the dimensionality of the given signal. Traditionally, the use of predefined dictionaries has been prevalent in sparse analysis. However, a more generalized approach is to learn the dictionary from the signal itself. Learnt dictionaries are known to outperform predefined dictionaries in several applications. This thesis explores the application of sparse representations of signals obtained by learning overcomplete dictionaries for three applications: 1) classification of images and videos, 2) measurement of similarity between two images, and 3) assessment of perceptual quality of an image. This thesis first capitalizes on the natural discriminative ability of sparse representations to develop efficient classification algorithms. The proposed algorithms are employed in image-based face recognition and video-based human action recognition. They are shown to perform better than the state-of-the-art. The thesis then studies how to obtain a good measure of similarity between two images. Despite the long history of image similarity evaluation, open issues still exist. These include the need of developing generic similarity measures that do not assume any prior knowledge of the task at hand or the data type. This thesis develops a generic image similarity measure based on learning sparse representations. Successful application of the proposed measure to clustering, retrieval and classification of different types of images is demonstrated. The thesis then examines a highly promising approach to assess the perceptual quality of an image. This approach involves comparing the structural information of a possibly distorted image with that in its reference image. The extraction of the structural information that is important to our visual system is a challenging task. A sparse representation-based image quality assessment approach is proposed to address this issue. When compared with seven existing metrics, our method performs the best in three databases and ranks among the top three in the remaining three databases. === Applied Science, Faculty of === Electrical and Computer Engineering, Department of === Graduate
author Guha, Tanaya
spellingShingle Guha, Tanaya
Image and video classification and image similarity measurement by learning sparse representation
author_facet Guha, Tanaya
author_sort Guha, Tanaya
title Image and video classification and image similarity measurement by learning sparse representation
title_short Image and video classification and image similarity measurement by learning sparse representation
title_full Image and video classification and image similarity measurement by learning sparse representation
title_fullStr Image and video classification and image similarity measurement by learning sparse representation
title_full_unstemmed Image and video classification and image similarity measurement by learning sparse representation
title_sort image and video classification and image similarity measurement by learning sparse representation
publisher University of British Columbia
publishDate 2013
url http://hdl.handle.net/2429/45122
work_keys_str_mv AT guhatanaya imageandvideoclassificationandimagesimilaritymeasurementbylearningsparserepresentation
_version_ 1718584001628733440