LEARNING A VISUALFORWARD MODEL

Abstract Internal forward models are aimed to provide the system with the prediction of changes in sensory observations as the consequent of its own actions. For the special case where the sensed information is in the form of the camera images, the model is called visual forward model. Images are on...

Full description

Bibliographic Details
Main Author: GHADIRZADEH, ALI
Format: Others
Language:English
Published: KTH, Skolan för datavetenskap och kommunikation (CSC) 2013
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-142034
id ndltd-UPSALLA1-oai-DiVA.org-kth-142034
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-kth-1420342018-01-12T05:12:52ZLEARNING A VISUALFORWARD MODELengGHADIRZADEH, ALIKTH, Skolan för datavetenskap och kommunikation (CSC)2013Computer SciencesDatavetenskap (datalogi)Abstract Internal forward models are aimed to provide the system with the prediction of changes in sensory observations as the consequent of its own actions. For the special case where the sensed information is in the form of the camera images, the model is called visual forward model. Images are one of the richest resources of data and the ability to predict the sensory camera images, enables the robots to do more autonomous and intelligent tasks. Most of actions performed by robots lead to outcomes which are appearing in the vision system. Therefor the capability to predict these outcomes in the form of images helps the robot to execute better long- term plans. That is why the visual forward models are of particular importance. The main challenges regarding the construction of the visual forward models are the high amount of image data to be predicted and the degrees of freedom of the robot's action which causes the complexities to grow rapidly. In this work, we have investigated dierent methods to construct the visual forward models for a robotic camera head setup. The forward model explores the contin- gencies between the movements in the robot's neck and eye joints and the resulting changes in the camera images. Four dierent methods to construct the visual for- ward models are introduced and implemented. Learning of the forward models in these methods is based on linear interpolation, radial basis function networks or Gaussian processes given the correspondences between the successive frames ex- tracted by the use of SURF descriptors or constructions of so-called cumulator units. To examine the performance of the proposed methods, two dierent types of experiments are designed with the dierence that in the rst experiment, depth information is not relevant while in the second one it is. Our experimental results show the success of the introduced methods in the construction of the visual forward models also provide the weak and strong aspects of each method. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-142034application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic Computer Sciences
Datavetenskap (datalogi)
spellingShingle Computer Sciences
Datavetenskap (datalogi)
GHADIRZADEH, ALI
LEARNING A VISUALFORWARD MODEL
description Abstract Internal forward models are aimed to provide the system with the prediction of changes in sensory observations as the consequent of its own actions. For the special case where the sensed information is in the form of the camera images, the model is called visual forward model. Images are one of the richest resources of data and the ability to predict the sensory camera images, enables the robots to do more autonomous and intelligent tasks. Most of actions performed by robots lead to outcomes which are appearing in the vision system. Therefor the capability to predict these outcomes in the form of images helps the robot to execute better long- term plans. That is why the visual forward models are of particular importance. The main challenges regarding the construction of the visual forward models are the high amount of image data to be predicted and the degrees of freedom of the robot's action which causes the complexities to grow rapidly. In this work, we have investigated dierent methods to construct the visual forward models for a robotic camera head setup. The forward model explores the contin- gencies between the movements in the robot's neck and eye joints and the resulting changes in the camera images. Four dierent methods to construct the visual for- ward models are introduced and implemented. Learning of the forward models in these methods is based on linear interpolation, radial basis function networks or Gaussian processes given the correspondences between the successive frames ex- tracted by the use of SURF descriptors or constructions of so-called cumulator units. To examine the performance of the proposed methods, two dierent types of experiments are designed with the dierence that in the rst experiment, depth information is not relevant while in the second one it is. Our experimental results show the success of the introduced methods in the construction of the visual forward models also provide the weak and strong aspects of each method.
author GHADIRZADEH, ALI
author_facet GHADIRZADEH, ALI
author_sort GHADIRZADEH, ALI
title LEARNING A VISUALFORWARD MODEL
title_short LEARNING A VISUALFORWARD MODEL
title_full LEARNING A VISUALFORWARD MODEL
title_fullStr LEARNING A VISUALFORWARD MODEL
title_full_unstemmed LEARNING A VISUALFORWARD MODEL
title_sort learning a visualforward model
publisher KTH, Skolan för datavetenskap och kommunikation (CSC)
publishDate 2013
url http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-142034
work_keys_str_mv AT ghadirzadehali learningavisualforwardmodel
_version_ 1718606336314310656