Data famine in big data era : machine learning algorithms for visual object recognition with limited training data
Big data is an increasingly attractive concept in many fields both in academia and in industry. The increasing amount of information actually builds an illusion that we are going to have enough data to solve all the data driven problems. Unfortunately it is not true, especially for areas where machi...
Main Author: | |
---|---|
Language: | English |
Published: |
University of British Columbia
2014
|
Online Access: | http://hdl.handle.net/2429/46412 |
id |
ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.2429-46412 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.2429-464122014-04-16T03:42:40Z Data famine in big data era : machine learning algorithms for visual object recognition with limited training data Guo, Zhenyu Big data is an increasingly attractive concept in many fields both in academia and in industry. The increasing amount of information actually builds an illusion that we are going to have enough data to solve all the data driven problems. Unfortunately it is not true, especially for areas where machine learning methods are heavily employed, since sufficient high-quality training data doesn't necessarily come with the big data, and it is not easy or sometimes impossible to collect sufficient training samples, which most computational algorithms depend on. This thesis mainly focuses on dealing situations with limited training data in visual object recognition, by developing novel machine learning algorithms to overcome the limited training data difficulty. We investigate three issues in object recognition involving limited training data: 1. one-shot object recognition, 2. cross-domain object recognition, and 3. object recognition for images with different picture styles. For Issue 1, we propose an unsupervised feature learning algorithm by constructing a deep structure of the stacked Hierarchical Dirichlet Process (HDP) auto-encoder, in order to extract "semantic" information from unlabeled source images. For Issue 2, we propose a Domain Adaptive Input-Output Kernel Learning algorithm to reduce the domain shifts in both input and output spaces. For Issue 3, we introduce a new problem involving images with different picture styles, successfully formulate the relationship between pixel mapping functions with gradient based image descriptors, and also propose a multiple kernel based algorithm to learn an optimal combination of basis pixel mapping functions to improve the recognition accuracy. For all the proposed algorithms, experimental results on publicly available data sets demonstrate the performance improvements over previous state-of-arts. 2014-04-14T21:06:56Z 2014-04-14T21:06:56Z 2014 2014-04-14 2014-05 Electronic Thesis or Dissertation http://hdl.handle.net/2429/46412 eng http://creativecommons.org/licenses/by-nc-nd/2.5/ca/ Attribution-NonCommercial-NoDerivs 2.5 Canada University of British Columbia |
collection |
NDLTD |
language |
English |
sources |
NDLTD |
description |
Big data is an increasingly attractive concept in many fields both in academia and in industry. The increasing amount of information actually builds an illusion that we are going to have enough data to solve all the data driven problems. Unfortunately it is not true, especially for areas where machine learning methods are heavily employed, since sufficient high-quality training data doesn't necessarily come with the big data, and it is not easy or sometimes impossible to collect sufficient training samples, which most computational algorithms depend on. This thesis mainly focuses on dealing situations with limited training data in visual object recognition, by developing novel machine learning algorithms to overcome the limited training data difficulty.
We investigate three issues in object recognition involving limited training data: 1. one-shot object recognition, 2. cross-domain object recognition, and 3. object recognition for images with different picture styles. For Issue 1, we propose an unsupervised feature learning algorithm by constructing a deep structure of the stacked Hierarchical Dirichlet Process (HDP) auto-encoder, in order to extract "semantic" information from unlabeled source images. For Issue 2, we propose a Domain Adaptive Input-Output Kernel Learning algorithm to reduce the domain shifts in both input and output spaces. For Issue 3, we introduce a new problem involving images with different picture styles, successfully formulate the relationship between pixel mapping functions with gradient based image descriptors, and also propose a multiple kernel based algorithm to learn an optimal combination of basis pixel mapping functions to improve the recognition accuracy. For all the proposed algorithms, experimental results on publicly available data sets demonstrate the performance improvements over previous state-of-arts. |
author |
Guo, Zhenyu |
spellingShingle |
Guo, Zhenyu Data famine in big data era : machine learning algorithms for visual object recognition with limited training data |
author_facet |
Guo, Zhenyu |
author_sort |
Guo, Zhenyu |
title |
Data famine in big data era : machine learning algorithms for visual object recognition with limited training data |
title_short |
Data famine in big data era : machine learning algorithms for visual object recognition with limited training data |
title_full |
Data famine in big data era : machine learning algorithms for visual object recognition with limited training data |
title_fullStr |
Data famine in big data era : machine learning algorithms for visual object recognition with limited training data |
title_full_unstemmed |
Data famine in big data era : machine learning algorithms for visual object recognition with limited training data |
title_sort |
data famine in big data era : machine learning algorithms for visual object recognition with limited training data |
publisher |
University of British Columbia |
publishDate |
2014 |
url |
http://hdl.handle.net/2429/46412 |
work_keys_str_mv |
AT guozhenyu datafamineinbigdataeramachinelearningalgorithmsforvisualobjectrecognitionwithlimitedtrainingdata |
_version_ |
1716662895429812224 |