Image and video classification and image similarity measurement by learning sparse representation

Sparse representation of signals has recently emerged as a major research area. It is well-known that many natural signals can be sparsely represented using a properly chosen dictionary (e.g. formed of wavelets bases). A dictionary could be complete or overcomplete depending on whether the number of...

Full description

Bibliographic Details
Main Author:	Guha, Tanaya
Language:	English
Published:	University of British Columbia 2013
Online Access:	http://hdl.handle.net/2429/45122

id	ndltd-UBC-oai-circle.library.ubc.ca-2429-45122
record_format	oai_dc
spelling	ndltd-UBC-oai-circle.library.ubc.ca-2429-451222018-01-05T17:26:57Z Image and video classification and image similarity measurement by learning sparse representation Guha, Tanaya Sparse representation of signals has recently emerged as a major research area. It is well-known that many natural signals can be sparsely represented using a properly chosen dictionary (e.g. formed of wavelets bases). A dictionary could be complete or overcomplete depending on whether the number of bases it contains is the same or greater than the dimensionality of the given signal. Traditionally, the use of predefined dictionaries has been prevalent in sparse analysis. However, a more generalized approach is to learn the dictionary from the signal itself. Learnt dictionaries are known to outperform predefined dictionaries in several applications. This thesis explores the application of sparse representations of signals obtained by learning overcomplete dictionaries for three applications: 1) classification of images and videos, 2) measurement of similarity between two images, and 3) assessment of perceptual quality of an image. This thesis first capitalizes on the natural discriminative ability of sparse representations to develop efficient classification algorithms. The proposed algorithms are employed in image-based face recognition and video-based human action recognition. They are shown to perform better than the state-of-the-art. The thesis then studies how to obtain a good measure of similarity between two images. Despite the long history of image similarity evaluation, open issues still exist. These include the need of developing generic similarity measures that do not assume any prior knowledge of the task at hand or the data type. This thesis develops a generic image similarity measure based on learning sparse representations. Successful application of the proposed measure to clustering, retrieval and classification of different types of images is demonstrated. The thesis then examines a highly promising approach to assess the perceptual quality of an image. This approach involves comparing the structural information of a possibly distorted image with that in its reference image. The extraction of the structural information that is important to our visual system is a challenging task. A sparse representation-based image quality assessment approach is proposed to address this issue. When compared with seven existing metrics, our method performs the best in three databases and ranks among the top three in the remaining three databases. Applied Science, Faculty of Electrical and Computer Engineering, Department of Graduate 2013-09-24T15:58:43Z 2013-09-24T15:58:43Z 2013 2013-11 Text Thesis/Dissertation http://hdl.handle.net/2429/45122 eng Attribution-NonCommercial-NoDerivs 2.5 Canada http://creativecommons.org/licenses/by-nc-nd/2.5/ca/ University of British Columbia
collection	NDLTD
language	English
sources	NDLTD
description	Sparse representation of signals has recently emerged as a major research area. It is well-known that many natural signals can be sparsely represented using a properly chosen dictionary (e.g. formed of wavelets bases). A dictionary could be complete or overcomplete depending on whether the number of bases it contains is the same or greater than the dimensionality of the given signal. Traditionally, the use of predefined dictionaries has been prevalent in sparse analysis. However, a more generalized approach is to learn the dictionary from the signal itself. Learnt dictionaries are known to outperform predefined dictionaries in several applications. This thesis explores the application of sparse representations of signals obtained by learning overcomplete dictionaries for three applications: 1) classification of images and videos, 2) measurement of similarity between two images, and 3) assessment of perceptual quality of an image. This thesis first capitalizes on the natural discriminative ability of sparse representations to develop efficient classification algorithms. The proposed algorithms are employed in image-based face recognition and video-based human action recognition. They are shown to perform better than the state-of-the-art. The thesis then studies how to obtain a good measure of similarity between two images. Despite the long history of image similarity evaluation, open issues still exist. These include the need of developing generic similarity measures that do not assume any prior knowledge of the task at hand or the data type. This thesis develops a generic image similarity measure based on learning sparse representations. Successful application of the proposed measure to clustering, retrieval and classification of different types of images is demonstrated. The thesis then examines a highly promising approach to assess the perceptual quality of an image. This approach involves comparing the structural information of a possibly distorted image with that in its reference image. The extraction of the structural information that is important to our visual system is a challenging task. A sparse representation-based image quality assessment approach is proposed to address this issue. When compared with seven existing metrics, our method performs the best in three databases and ranks among the top three in the remaining three databases. === Applied Science, Faculty of === Electrical and Computer Engineering, Department of === Graduate
author	Guha, Tanaya
spellingShingle	Guha, Tanaya Image and video classification and image similarity measurement by learning sparse representation
author_facet	Guha, Tanaya
author_sort	Guha, Tanaya
title	Image and video classification and image similarity measurement by learning sparse representation
title_short	Image and video classification and image similarity measurement by learning sparse representation
title_full	Image and video classification and image similarity measurement by learning sparse representation
title_fullStr	Image and video classification and image similarity measurement by learning sparse representation
title_full_unstemmed	Image and video classification and image similarity measurement by learning sparse representation
title_sort	image and video classification and image similarity measurement by learning sparse representation
publisher	University of British Columbia
publishDate	2013
url	http://hdl.handle.net/2429/45122
work_keys_str_mv	AT guhatanaya imageandvideoclassificationandimagesimilaritymeasurementbylearningsparserepresentation
_version_	1718584001628733440

Image and video classification and image similarity measurement by learning sparse representation

Similar Items