Compact Representations and Multi-cue Integration for Robotics

This thesis presents methods useful in a bin picking application, such as detection and representation of local features, pose estimation and multi-cue integration. The scene tensor is a representation of multiple line or edge segments and was first introduced by Nordberg in [30]. A method for estim...

Full description

Bibliographic Details
Main Author: Söderberg, Robert
Format: Others
Language:English
Published: Linköpings universitet, Bildbehandling 2005
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-5574
http://nbn-resolving.de/urn:isbn:91-85299-36-7
id ndltd-UPSALLA1-oai-DiVA.org-liu-5574
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-liu-55742013-01-08T13:10:22ZCompact Representations and Multi-cue Integration for RoboticsengSöderberg, RobertLinköpings universitet, BildbehandlingLinköpings universitet, Tekniska högskolanInstitutionen för systemteknik2005Electrical engineeringbin picking applicationdetection and representation of local featurespose estimationmulti-cue integrationElektroteknikElectrical engineeringElektroteknikThis thesis presents methods useful in a bin picking application, such as detection and representation of local features, pose estimation and multi-cue integration. The scene tensor is a representation of multiple line or edge segments and was first introduced by Nordberg in [30]. A method for estimating scene tensors from gray-scale images is presented. The method is based on orientation tensors, where the scene tensor can be estimated by correlations of the elements in the orientation tensor with a number of 1D filters. Mechanisms for analyzing the scene tensor are described and an algorithm for detecting interest points and estimating feature parameters is presented. It is shown that the algorithm works on a wide spectrum of images with good result. Representations that are invariant with respect to a set of transformations are useful in many applications, such as pose estimation, tracking and wide baseline stereo. The scene tensor itself is not invariant and three different methods for implementing an invariant representation based on the scene tensor is presented. One is based on a non-linear transformation of the scene tensor and is invariant to perspective transformations. Two versions of a tensor doublet is presented, which is based on a geometry of two interest points and is invariant to translation, rotation and scaling. The tensor doublet is used in a framework for view centered pose estimation of 3D objects. It is shown that the pose estimation algorithm has good performance even though the object is occluded and has a different scale compared to the training situation. An industrial implementation of a bin picking application have to cope with several different types of objects. All pose estimation algorithms use some kind of model and there is yet no model that can cope with all kinds of situations and objects. This thesis presents a method for integrating cues from several pose estimation algorithms for increasing the system stability. It is also shown that the same framework can also be used for increasing the accuracy of the system by using cues from several views of the object. An extensive test with several different objects, lighting conditions and backgrounds shows that multi-cue integration makes the system more robust and increases the accuracy. Finally, a system for bin picking is presented, built from the previous parts of this thesis. An eye in hand setup is used with a standard industrial robot arm. It is shown that the system works for real bin-picking situations with a positioning error below 1 mm and an orientation error below 1o degree for most of the different situations. Report code: LiU-TEK-LIC-2005:15.Licentiate thesis, monographinfo:eu-repo/semantics/masterThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-5574urn:isbn:91-85299-36-7Linköping Studies in Science and Technology. Thesis, 0280-7971 ; 1160 ; application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic Electrical engineering
bin picking application
detection and representation of local features
pose estimation
multi-cue integration
Elektroteknik
Electrical engineering
Elektroteknik
spellingShingle Electrical engineering
bin picking application
detection and representation of local features
pose estimation
multi-cue integration
Elektroteknik
Electrical engineering
Elektroteknik
Söderberg, Robert
Compact Representations and Multi-cue Integration for Robotics
description This thesis presents methods useful in a bin picking application, such as detection and representation of local features, pose estimation and multi-cue integration. The scene tensor is a representation of multiple line or edge segments and was first introduced by Nordberg in [30]. A method for estimating scene tensors from gray-scale images is presented. The method is based on orientation tensors, where the scene tensor can be estimated by correlations of the elements in the orientation tensor with a number of 1D filters. Mechanisms for analyzing the scene tensor are described and an algorithm for detecting interest points and estimating feature parameters is presented. It is shown that the algorithm works on a wide spectrum of images with good result. Representations that are invariant with respect to a set of transformations are useful in many applications, such as pose estimation, tracking and wide baseline stereo. The scene tensor itself is not invariant and three different methods for implementing an invariant representation based on the scene tensor is presented. One is based on a non-linear transformation of the scene tensor and is invariant to perspective transformations. Two versions of a tensor doublet is presented, which is based on a geometry of two interest points and is invariant to translation, rotation and scaling. The tensor doublet is used in a framework for view centered pose estimation of 3D objects. It is shown that the pose estimation algorithm has good performance even though the object is occluded and has a different scale compared to the training situation. An industrial implementation of a bin picking application have to cope with several different types of objects. All pose estimation algorithms use some kind of model and there is yet no model that can cope with all kinds of situations and objects. This thesis presents a method for integrating cues from several pose estimation algorithms for increasing the system stability. It is also shown that the same framework can also be used for increasing the accuracy of the system by using cues from several views of the object. An extensive test with several different objects, lighting conditions and backgrounds shows that multi-cue integration makes the system more robust and increases the accuracy. Finally, a system for bin picking is presented, built from the previous parts of this thesis. An eye in hand setup is used with a standard industrial robot arm. It is shown that the system works for real bin-picking situations with a positioning error below 1 mm and an orientation error below 1o degree for most of the different situations. === Report code: LiU-TEK-LIC-2005:15.
author Söderberg, Robert
author_facet Söderberg, Robert
author_sort Söderberg, Robert
title Compact Representations and Multi-cue Integration for Robotics
title_short Compact Representations and Multi-cue Integration for Robotics
title_full Compact Representations and Multi-cue Integration for Robotics
title_fullStr Compact Representations and Multi-cue Integration for Robotics
title_full_unstemmed Compact Representations and Multi-cue Integration for Robotics
title_sort compact representations and multi-cue integration for robotics
publisher Linköpings universitet, Bildbehandling
publishDate 2005
url http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-5574
http://nbn-resolving.de/urn:isbn:91-85299-36-7
work_keys_str_mv AT soderbergrobert compactrepresentationsandmulticueintegrationforrobotics
_version_ 1716510676557496320