A Learning-based Algorithm for Natural Scene Recognition
碩士 === 國立臺灣科技大學 === 資訊工程系 === 103 === Scene recognition is an important problem in many application areas of image and video processing. Scene recognition has a wide range of applications, such as object recognition and detection, content-based image indexing and retrieval and intelligent vehicle an...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2015
|
Online Access: | http://ndltd.ncl.edu.tw/handle/32032014120400974876 |
id |
ndltd-TW-103NTUS5392061 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-103NTUS53920612016-11-06T04:19:40Z http://ndltd.ncl.edu.tw/handle/32032014120400974876 A Learning-based Algorithm for Natural Scene Recognition 一個基於學習演算法的自然影像辨識系統 Thang Duy Dang 鄧惟勝 碩士 國立臺灣科技大學 資訊工程系 103 Scene recognition is an important problem in many application areas of image and video processing. Scene recognition has a wide range of applications, such as object recognition and detection, content-based image indexing and retrieval and intelligent vehicle and robot navigation. However, the natural scene images tend to be very complex and difficult to analyze due to changes of illumination and transformation. In this thesis, we will investigate into building a novel model to learn and recognize scenes in nature. This study proposed a new approach that combines locality-constrained sparse coding (LCSP), Spatial Pyramid Pooling and linear SVM in end-to-end model. Firstly, interesting points each image in the training set are extracted by a local descriptor as dense SIFT which represents local spatial information. These features known as codewords and each codeword is represented as part of a topic. Then we employs LCSP algorithm to learn the codeword distribution of those local features from the training dataset. Next, a modified Spatial Pyramid Pooling model is employed for encoding the spatial distribution of local features. Spatial Pyramid Pooling model has been remarkably successful in terms of both scene and object recognition. In the testing stage, a linear SVM will be used to classify local features which are encoded by Spatial Pyramid Pooling. The new system achieved very competitive results and leading to state-of-the-art performance on several benchmarks. Kai-Lung Hua 花凱龍 2015 學位論文 ; thesis 45 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣科技大學 === 資訊工程系 === 103 === Scene recognition is an important problem in many application areas of image and video processing. Scene recognition has a wide range of applications, such as object recognition and detection, content-based image indexing and retrieval and intelligent vehicle and robot navigation. However, the natural scene images tend to be very complex and difficult to analyze due to changes of illumination and transformation. In this thesis, we will investigate into building a novel model to learn and recognize scenes in nature.
This study proposed a new approach that combines locality-constrained sparse coding (LCSP), Spatial Pyramid Pooling and linear SVM in end-to-end model. Firstly, interesting points each image in the training set are extracted by a local descriptor as dense SIFT which represents local spatial information. These features known as codewords and each codeword is represented as part of a topic. Then we employs LCSP algorithm to learn the codeword distribution of those local features from the training dataset. Next, a modified Spatial Pyramid Pooling model is employed for encoding the spatial distribution of local features. Spatial Pyramid Pooling model has been remarkably successful in terms of both scene and object recognition. In the testing stage, a linear SVM will be used to classify local features which are encoded by Spatial Pyramid Pooling. The new system achieved very competitive results and leading to state-of-the-art performance on several benchmarks.
|
author2 |
Kai-Lung Hua |
author_facet |
Kai-Lung Hua Thang Duy Dang 鄧惟勝 |
author |
Thang Duy Dang 鄧惟勝 |
spellingShingle |
Thang Duy Dang 鄧惟勝 A Learning-based Algorithm for Natural Scene Recognition |
author_sort |
Thang Duy Dang |
title |
A Learning-based Algorithm for Natural Scene Recognition |
title_short |
A Learning-based Algorithm for Natural Scene Recognition |
title_full |
A Learning-based Algorithm for Natural Scene Recognition |
title_fullStr |
A Learning-based Algorithm for Natural Scene Recognition |
title_full_unstemmed |
A Learning-based Algorithm for Natural Scene Recognition |
title_sort |
learning-based algorithm for natural scene recognition |
publishDate |
2015 |
url |
http://ndltd.ncl.edu.tw/handle/32032014120400974876 |
work_keys_str_mv |
AT thangduydang alearningbasedalgorithmfornaturalscenerecognition AT dèngwéishèng alearningbasedalgorithmfornaturalscenerecognition AT thangduydang yīgèjīyúxuéxíyǎnsuànfǎdezìrányǐngxiàngbiànshíxìtǒng AT dèngwéishèng yīgèjīyúxuéxíyǎnsuànfǎdezìrányǐngxiàngbiànshíxìtǒng AT thangduydang learningbasedalgorithmfornaturalscenerecognition AT dèngwéishèng learningbasedalgorithmfornaturalscenerecognition |
_version_ |
1718391522907389952 |