Data famine in big data era : machine learning algorithms for visual object recognition with limited training data

Big data is an increasingly attractive concept in many fields both in academia and in industry. The increasing amount of information actually builds an illusion that we are going to have enough data to solve all the data driven problems. Unfortunately it is not true, especially for areas where machi...

Full description

Bibliographic Details
Main Author:	Guo, Zhenyu
Language:	English
Published:	University of British Columbia 2014
Online Access:	http://hdl.handle.net/2429/46412

id	ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.2429-46412
record_format	oai_dc
spelling	ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.2429-464122014-04-16T03:42:40Z Data famine in big data era : machine learning algorithms for visual object recognition with limited training data Guo, Zhenyu Big data is an increasingly attractive concept in many fields both in academia and in industry. The increasing amount of information actually builds an illusion that we are going to have enough data to solve all the data driven problems. Unfortunately it is not true, especially for areas where machine learning methods are heavily employed, since sufficient high-quality training data doesn't necessarily come with the big data, and it is not easy or sometimes impossible to collect sufficient training samples, which most computational algorithms depend on. This thesis mainly focuses on dealing situations with limited training data in visual object recognition, by developing novel machine learning algorithms to overcome the limited training data difficulty. We investigate three issues in object recognition involving limited training data: 1. one-shot object recognition, 2. cross-domain object recognition, and 3. object recognition for images with different picture styles. For Issue 1, we propose an unsupervised feature learning algorithm by constructing a deep structure of the stacked Hierarchical Dirichlet Process (HDP) auto-encoder, in order to extract "semantic" information from unlabeled source images. For Issue 2, we propose a Domain Adaptive Input-Output Kernel Learning algorithm to reduce the domain shifts in both input and output spaces. For Issue 3, we introduce a new problem involving images with different picture styles, successfully formulate the relationship between pixel mapping functions with gradient based image descriptors, and also propose a multiple kernel based algorithm to learn an optimal combination of basis pixel mapping functions to improve the recognition accuracy. For all the proposed algorithms, experimental results on publicly available data sets demonstrate the performance improvements over previous state-of-arts. 2014-04-14T21:06:56Z 2014-04-14T21:06:56Z 2014 2014-04-14 2014-05 Electronic Thesis or Dissertation http://hdl.handle.net/2429/46412 eng http://creativecommons.org/licenses/by-nc-nd/2.5/ca/ Attribution-NonCommercial-NoDerivs 2.5 Canada University of British Columbia
collection	NDLTD
language	English
sources	NDLTD
description	Big data is an increasingly attractive concept in many fields both in academia and in industry. The increasing amount of information actually builds an illusion that we are going to have enough data to solve all the data driven problems. Unfortunately it is not true, especially for areas where machine learning methods are heavily employed, since sufficient high-quality training data doesn't necessarily come with the big data, and it is not easy or sometimes impossible to collect sufficient training samples, which most computational algorithms depend on. This thesis mainly focuses on dealing situations with limited training data in visual object recognition, by developing novel machine learning algorithms to overcome the limited training data difficulty. We investigate three issues in object recognition involving limited training data: 1. one-shot object recognition, 2. cross-domain object recognition, and 3. object recognition for images with different picture styles. For Issue 1, we propose an unsupervised feature learning algorithm by constructing a deep structure of the stacked Hierarchical Dirichlet Process (HDP) auto-encoder, in order to extract "semantic" information from unlabeled source images. For Issue 2, we propose a Domain Adaptive Input-Output Kernel Learning algorithm to reduce the domain shifts in both input and output spaces. For Issue 3, we introduce a new problem involving images with different picture styles, successfully formulate the relationship between pixel mapping functions with gradient based image descriptors, and also propose a multiple kernel based algorithm to learn an optimal combination of basis pixel mapping functions to improve the recognition accuracy. For all the proposed algorithms, experimental results on publicly available data sets demonstrate the performance improvements over previous state-of-arts.
author	Guo, Zhenyu
spellingShingle	Guo, Zhenyu Data famine in big data era : machine learning algorithms for visual object recognition with limited training data
author_facet	Guo, Zhenyu
author_sort	Guo, Zhenyu
title	Data famine in big data era : machine learning algorithms for visual object recognition with limited training data
title_short	Data famine in big data era : machine learning algorithms for visual object recognition with limited training data
title_full	Data famine in big data era : machine learning algorithms for visual object recognition with limited training data
title_fullStr	Data famine in big data era : machine learning algorithms for visual object recognition with limited training data
title_full_unstemmed	Data famine in big data era : machine learning algorithms for visual object recognition with limited training data
title_sort	data famine in big data era : machine learning algorithms for visual object recognition with limited training data
publisher	University of British Columbia
publishDate	2014
url	http://hdl.handle.net/2429/46412
work_keys_str_mv	AT guozhenyu datafamineinbigdataeramachinelearningalgorithmsforvisualobjectrecognitionwithlimitedtrainingdata
_version_	1716662895429812224

Data famine in big data era : machine learning algorithms for visual object recognition with limited training data

Similar Items