Data famine in big data era : machine learning algorithms for visual object recognition with limited training data

Big data is an increasingly attractive concept in many fields both in academia and in industry. The increasing amount of information actually builds an illusion that we are going to have enough data to solve all the data driven problems. Unfortunately it is not true, especially for areas where machi...

Full description

Bibliographic Details
Main Author: Guo, Zhenyu
Language:English
Published: University of British Columbia 2014
Online Access:http://hdl.handle.net/2429/46412
id ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.2429-46412
record_format oai_dc
spelling ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.2429-464122014-04-16T03:42:40Z Data famine in big data era : machine learning algorithms for visual object recognition with limited training data Guo, Zhenyu Big data is an increasingly attractive concept in many fields both in academia and in industry. The increasing amount of information actually builds an illusion that we are going to have enough data to solve all the data driven problems. Unfortunately it is not true, especially for areas where machine learning methods are heavily employed, since sufficient high-quality training data doesn't necessarily come with the big data, and it is not easy or sometimes impossible to collect sufficient training samples, which most computational algorithms depend on. This thesis mainly focuses on dealing situations with limited training data in visual object recognition, by developing novel machine learning algorithms to overcome the limited training data difficulty. We investigate three issues in object recognition involving limited training data: 1. one-shot object recognition, 2. cross-domain object recognition, and 3. object recognition for images with different picture styles. For Issue 1, we propose an unsupervised feature learning algorithm by constructing a deep structure of the stacked Hierarchical Dirichlet Process (HDP) auto-encoder, in order to extract "semantic" information from unlabeled source images. For Issue 2, we propose a Domain Adaptive Input-Output Kernel Learning algorithm to reduce the domain shifts in both input and output spaces. For Issue 3, we introduce a new problem involving images with different picture styles, successfully formulate the relationship between pixel mapping functions with gradient based image descriptors, and also propose a multiple kernel based algorithm to learn an optimal combination of basis pixel mapping functions to improve the recognition accuracy. For all the proposed algorithms, experimental results on publicly available data sets demonstrate the performance improvements over previous state-of-arts. 2014-04-14T21:06:56Z 2014-04-14T21:06:56Z 2014 2014-04-14 2014-05 Electronic Thesis or Dissertation http://hdl.handle.net/2429/46412 eng http://creativecommons.org/licenses/by-nc-nd/2.5/ca/ Attribution-NonCommercial-NoDerivs 2.5 Canada University of British Columbia
collection NDLTD
language English
sources NDLTD
description Big data is an increasingly attractive concept in many fields both in academia and in industry. The increasing amount of information actually builds an illusion that we are going to have enough data to solve all the data driven problems. Unfortunately it is not true, especially for areas where machine learning methods are heavily employed, since sufficient high-quality training data doesn't necessarily come with the big data, and it is not easy or sometimes impossible to collect sufficient training samples, which most computational algorithms depend on. This thesis mainly focuses on dealing situations with limited training data in visual object recognition, by developing novel machine learning algorithms to overcome the limited training data difficulty. We investigate three issues in object recognition involving limited training data: 1. one-shot object recognition, 2. cross-domain object recognition, and 3. object recognition for images with different picture styles. For Issue 1, we propose an unsupervised feature learning algorithm by constructing a deep structure of the stacked Hierarchical Dirichlet Process (HDP) auto-encoder, in order to extract "semantic" information from unlabeled source images. For Issue 2, we propose a Domain Adaptive Input-Output Kernel Learning algorithm to reduce the domain shifts in both input and output spaces. For Issue 3, we introduce a new problem involving images with different picture styles, successfully formulate the relationship between pixel mapping functions with gradient based image descriptors, and also propose a multiple kernel based algorithm to learn an optimal combination of basis pixel mapping functions to improve the recognition accuracy. For all the proposed algorithms, experimental results on publicly available data sets demonstrate the performance improvements over previous state-of-arts.
author Guo, Zhenyu
spellingShingle Guo, Zhenyu
Data famine in big data era : machine learning algorithms for visual object recognition with limited training data
author_facet Guo, Zhenyu
author_sort Guo, Zhenyu
title Data famine in big data era : machine learning algorithms for visual object recognition with limited training data
title_short Data famine in big data era : machine learning algorithms for visual object recognition with limited training data
title_full Data famine in big data era : machine learning algorithms for visual object recognition with limited training data
title_fullStr Data famine in big data era : machine learning algorithms for visual object recognition with limited training data
title_full_unstemmed Data famine in big data era : machine learning algorithms for visual object recognition with limited training data
title_sort data famine in big data era : machine learning algorithms for visual object recognition with limited training data
publisher University of British Columbia
publishDate 2014
url http://hdl.handle.net/2429/46412
work_keys_str_mv AT guozhenyu datafamineinbigdataeramachinelearningalgorithmsforvisualobjectrecognitionwithlimitedtrainingdata
_version_ 1716662895429812224