Multi-View Object Recognition and Classification. Graph-BasedRepresentation of Visual Features and Structured Learning andPrediction.

Computer Vision is a subfield within artificial intelligence that includes methods for acquisition, processing, analysis and understanding of images to get results in numerical or symbolic form. The information provided by the results is used to make decisions.We do not speak ofComputer Vision in is...

Full description

Bibliographic Details
Main Author:	Hernández Pérez, Bernard
Format:	Others
Language:	English
Published:	KTH, Skolan för datavetenskap och kommunikation (CSC) 2013
Subjects:	Computer Sciences Datavetenskap (datalogi)
Online Access:	http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-142486

id	ndltd-UPSALLA1-oai-DiVA.org-kth-142486
record_format	oai_dc
spelling	ndltd-UPSALLA1-oai-DiVA.org-kth-1424862018-01-12T05:12:36ZMulti-View Object Recognition and Classification. Graph-BasedRepresentation of Visual Features and Structured Learning andPrediction.engHernández Pérez, BernardKTH, Skolan för datavetenskap och kommunikation (CSC)2013Computer SciencesDatavetenskap (datalogi)Computer Vision is a subfield within artificial intelligence that includes methods for acquisition, processing, analysis and understanding of images to get results in numerical or symbolic form. The information provided by the results is used to make decisions.We do not speak ofComputer Vision in isolation, interaction with other fields is inevitable and deserve particular attention image processing, pattern recognition and Machine Learning. The main objective of this project is to analyze the behavior of visual feature extraction algorithms and their effectiveness in decision making. The detection of an object in an image, its classification and recognition are the type of decisions that are studied. Feature extraction algorithms are applied to attempt multi-view object recognition. To tackle this problem a new approach is proposed. This approach creates a graph-based representation of the object using cluster analysis recursively. The nodes of the graph represent the main physical components that make up the object. Support Vector Machines (SVMs) are used to classify the nodes, thus classes are classified independently. Finally, the graph-based representation of the object is exploited to drop the assumption of independence and find relations between classes using Structured Output-Support Vector Machines (SO-SVMs). Datorseende är ett delområde inom artificiell intelligens som innehåller metoder för förvärv, bildbehandling, analys och förståelse av bilder för att få resultat i numerisk eller symbolisk form. Informationen som resultatet ger används för att fatta beslut. Vi kan inte tala om visioner i isolering, samspel med andra områden är oundviklig och förtjänar särskild uppmärksamhet bildbehandling, mönsterigenkänning och maskininlärning. Huvudsyftet med detta projekt är att analysera beteendet hos visuella algoritmers särdragsextraktion och deras effektivitet i beslutsfattande. Upptäckten av ett objekt i en bild, dess klassificering och erkännande är den typ av beslut som studeras. Algoritmers särdragsextraktion tillämpas för att försöka erkänna objekts mång-vy. För att tackla detta problem har ett nytt tillvägagångssätt föreslgits. Detta tillvägagångssätt skapar en grafbaserad representation av objektet med hjälp av rekursiv klusteranalys. Noderna i grafen representerar de viktigaste fysiska komponenterna i objektet. Support Vector Machines (SVMs) används för att klassificera noderna, dessa klasser klassificeras självständigt. Slutligen, grafbaserad representation av objekt utnyttjas för att släppa antagandet om oberoende och hitta relationer mellan klasser genom att använda Structured Output - Support Vector Machines (SOSVMs). Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-142486application/pdfinfo:eu-repo/semantics/openAccess
collection	NDLTD
language	English
format	Others
sources	NDLTD
topic	Computer Sciences Datavetenskap (datalogi)
spellingShingle	Computer Sciences Datavetenskap (datalogi) Hernández Pérez, Bernard Multi-View Object Recognition and Classification. Graph-BasedRepresentation of Visual Features and Structured Learning andPrediction.
description	Computer Vision is a subfield within artificial intelligence that includes methods for acquisition, processing, analysis and understanding of images to get results in numerical or symbolic form. The information provided by the results is used to make decisions.We do not speak ofComputer Vision in isolation, interaction with other fields is inevitable and deserve particular attention image processing, pattern recognition and Machine Learning. The main objective of this project is to analyze the behavior of visual feature extraction algorithms and their effectiveness in decision making. The detection of an object in an image, its classification and recognition are the type of decisions that are studied. Feature extraction algorithms are applied to attempt multi-view object recognition. To tackle this problem a new approach is proposed. This approach creates a graph-based representation of the object using cluster analysis recursively. The nodes of the graph represent the main physical components that make up the object. Support Vector Machines (SVMs) are used to classify the nodes, thus classes are classified independently. Finally, the graph-based representation of the object is exploited to drop the assumption of independence and find relations between classes using Structured Output-Support Vector Machines (SO-SVMs). === Datorseende är ett delområde inom artificiell intelligens som innehåller metoder för förvärv, bildbehandling, analys och förståelse av bilder för att få resultat i numerisk eller symbolisk form. Informationen som resultatet ger används för att fatta beslut. Vi kan inte tala om visioner i isolering, samspel med andra områden är oundviklig och förtjänar särskild uppmärksamhet bildbehandling, mönsterigenkänning och maskininlärning. Huvudsyftet med detta projekt är att analysera beteendet hos visuella algoritmers särdragsextraktion och deras effektivitet i beslutsfattande. Upptäckten av ett objekt i en bild, dess klassificering och erkännande är den typ av beslut som studeras. Algoritmers särdragsextraktion tillämpas för att försöka erkänna objekts mång-vy. För att tackla detta problem har ett nytt tillvägagångssätt föreslgits. Detta tillvägagångssätt skapar en grafbaserad representation av objektet med hjälp av rekursiv klusteranalys. Noderna i grafen representerar de viktigaste fysiska komponenterna i objektet. Support Vector Machines (SVMs) används för att klassificera noderna, dessa klasser klassificeras självständigt. Slutligen, grafbaserad representation av objekt utnyttjas för att släppa antagandet om oberoende och hitta relationer mellan klasser genom att använda Structured Output - Support Vector Machines (SOSVMs).
author	Hernández Pérez, Bernard
author_facet	Hernández Pérez, Bernard
author_sort	Hernández Pérez, Bernard
title	Multi-View Object Recognition and Classification. Graph-BasedRepresentation of Visual Features and Structured Learning andPrediction.
title_short	Multi-View Object Recognition and Classification. Graph-BasedRepresentation of Visual Features and Structured Learning andPrediction.
title_full	Multi-View Object Recognition and Classification. Graph-BasedRepresentation of Visual Features and Structured Learning andPrediction.
title_fullStr	Multi-View Object Recognition and Classification. Graph-BasedRepresentation of Visual Features and Structured Learning andPrediction.
title_full_unstemmed	Multi-View Object Recognition and Classification. Graph-BasedRepresentation of Visual Features and Structured Learning andPrediction.
title_sort	multi-view object recognition and classification. graph-basedrepresentation of visual features and structured learning andprediction.
publisher	KTH, Skolan för datavetenskap och kommunikation (CSC)
publishDate	2013
url	http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-142486
work_keys_str_mv	AT hernandezperezbernard multiviewobjectrecognitionandclassificationgraphbasedrepresentationofvisualfeaturesandstructuredlearningandprediction
_version_	1718606359069458432

Multi-View Object Recognition and Classification. Graph-BasedRepresentation of Visual Features and Structured Learning andPrediction.

Similar Items