Selection of relevant information to improve Image Classification using Bag of Visual Words

One of the main challenges in computer vision is image classification. Nowadays the number of images increases exponentially every day; therefore, it is important to classify them in a reliable way. The conventional image classification pipeline usually consists on extracting local image features, e...

Full description

Bibliographic Details
Main Author:	Eduardo Fidalgo Fernández
Format:	Article
Language:	English
Published:	Computer Vision Center Press 2018-03-01
Series:	ELCVIA Electronic Letters on Computer Vision and Image Analysis
Subjects:	Computer Vision Features and Image Descriptors Object Description and Recognition Suppor Vector Machines and Kernels Image Analysis and Processing Shape extraction and representation
Online Access:	https://elcvia.cvc.uab.es/article/view/1102

id	doaj-1bb61b106ae946ad85931ec663053466
record_format	Article
spelling	doaj-1bb61b106ae946ad85931ec6630534662021-09-18T12:38:28ZengComputer Vision Center PressELCVIA Electronic Letters on Computer Vision and Image Analysis1577-50972018-03-0116210.5565/rev/elcvia.1102317Selection of relevant information to improve Image Classification using Bag of Visual WordsEduardo Fidalgo Fernández0Universidad de LeónOne of the main challenges in computer vision is image classification. Nowadays the number of images increases exponentially every day; therefore, it is important to classify them in a reliable way. The conventional image classification pipeline usually consists on extracting local image features, encoding them as a feature vector and classify them using a previously created model. With regards to feature codification, the Bag of Words model and its extensions, such as pyramid matching and weighted schemes, have achieved quite good results and have become the state of the art methods. The process as mentioned above is not perfect and computers, as well as humans, may make mistakes in any of the steps, causing a performance drop in classification. Some of the primary sources of error on large-scale image classification are the presence of multiple objects in the image, small or very thin objects, incorrect annotations or fine-grained recognition tasks among others. Based on those problems and the steps of a typical image classification pipeline, the motivation of this PhD thesis was to provide some guidelines to improve the quality of the extracted features to obtain better classification results. The contributions of the PhD thesis demonstrated how a good feature selection can contribute to improving the fine-grained classification, and that there would even be no need to have a big training data set to learn the key features of each class and to predict with good results. https://elcvia.cvc.uab.es/article/view/1102Computer VisionFeatures and Image DescriptorsObject Description and RecognitionSuppor Vector Machines and KernelsImage Analysis and ProcessingShape extraction and representation
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Eduardo Fidalgo Fernández
spellingShingle	Eduardo Fidalgo Fernández Selection of relevant information to improve Image Classification using Bag of Visual Words ELCVIA Electronic Letters on Computer Vision and Image Analysis Computer Vision Features and Image Descriptors Object Description and Recognition Suppor Vector Machines and Kernels Image Analysis and Processing Shape extraction and representation
author_facet	Eduardo Fidalgo Fernández
author_sort	Eduardo Fidalgo Fernández
title	Selection of relevant information to improve Image Classification using Bag of Visual Words
title_short	Selection of relevant information to improve Image Classification using Bag of Visual Words
title_full	Selection of relevant information to improve Image Classification using Bag of Visual Words
title_fullStr	Selection of relevant information to improve Image Classification using Bag of Visual Words
title_full_unstemmed	Selection of relevant information to improve Image Classification using Bag of Visual Words
title_sort	selection of relevant information to improve image classification using bag of visual words
publisher	Computer Vision Center Press
series	ELCVIA Electronic Letters on Computer Vision and Image Analysis
issn	1577-5097
publishDate	2018-03-01
description	One of the main challenges in computer vision is image classification. Nowadays the number of images increases exponentially every day; therefore, it is important to classify them in a reliable way. The conventional image classification pipeline usually consists on extracting local image features, encoding them as a feature vector and classify them using a previously created model. With regards to feature codification, the Bag of Words model and its extensions, such as pyramid matching and weighted schemes, have achieved quite good results and have become the state of the art methods. The process as mentioned above is not perfect and computers, as well as humans, may make mistakes in any of the steps, causing a performance drop in classification. Some of the primary sources of error on large-scale image classification are the presence of multiple objects in the image, small or very thin objects, incorrect annotations or fine-grained recognition tasks among others. Based on those problems and the steps of a typical image classification pipeline, the motivation of this PhD thesis was to provide some guidelines to improve the quality of the extracted features to obtain better classification results. The contributions of the PhD thesis demonstrated how a good feature selection can contribute to improving the fine-grained classification, and that there would even be no need to have a big training data set to learn the key features of each class and to predict with good results.
topic	Computer Vision Features and Image Descriptors Object Description and Recognition Suppor Vector Machines and Kernels Image Analysis and Processing Shape extraction and representation
url	https://elcvia.cvc.uab.es/article/view/1102
work_keys_str_mv	AT eduardofidalgofernandez selectionofrelevantinformationtoimproveimageclassificationusingbagofvisualwords
_version_	1717376957221437440

Selection of relevant information to improve Image Classification using Bag of Visual Words

Similar Items