Shape Based Recognition – Cognitive Vision Systems in Traffic Safety Applications

Traffic accidents are globally the number one cause of death for people 15-29 years old and is among the top three causes for all age groups 5-44 years. Much of the work within this thesis has been carried out in projects aiming for (cognitive) driver assistance systems and hopefully represents a st...

Full description

Bibliographic Details
Main Author: Larsson, Fredrik
Format: Doctoral Thesis
Language:English
Published: Linköpings universitet, Datorseende 2011
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-71664
http://nbn-resolving.de/urn:isbn:978-91-7393-074-1
id ndltd-UPSALLA1-oai-DiVA.org-liu-71664
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-liu-716642016-05-05T05:12:17ZShape Based Recognition – Cognitive Vision Systems in Traffic Safety ApplicationsengLarsson, FredrikLinköpings universitet, DatorseendeLinköpings universitet, Tekniska högskolanLinköping : Linköping University Electronic Press2011Traffic accidents are globally the number one cause of death for people 15-29 years old and is among the top three causes for all age groups 5-44 years. Much of the work within this thesis has been carried out in projects aiming for (cognitive) driver assistance systems and hopefully represents a step towards improving traffic safety. The main contributions are within the area of Computer Vision, and more specifically, within the areas of shape matching, Bayesian tracking, and visual servoing with the main focus being on shape matching and applications thereof. The different methods have been demonstrated in traffic safety applications, such as  bicycle tracking, car tracking, and traffic sign recognition, as well as for pose estimation and robot control. One of the core contributions is a new method for recognizing closed contours, based on complex correlation of Fourier descriptors. It is shown that keeping the phase of Fourier descriptors is important. Neglecting the phase can result in perfect matches between intrinsically different shapes. Another benefit of keeping the phase is that rotation covariant or invariant matching is achieved in the same way. The only difference is to either consider the magnitude, for rotation invariant matching, or just the real value, for rotation covariant matching, of the complex valued correlation. The shape matching method has further been used in combination with an implicit star-shaped object model for traffic sign recognition. The presented method works fully automatically on query images with no need for regions-of-interests. It is shown that the presented method performs well for traffic signs that contain multiple distinct contours, while some improvement still is needed for signs defined by a single contour. The presented methodology is general enough to be used for arbitrary objects, as long as they can be defined by a number of regions. Another contribution has been the extension of a framework for learning based Bayesian tracking called channel based tracking. Compared to earlier work, the multi-dimensional case has been reformulated in a sound probabilistic way and the learning algorithm itself has been extended. The framework is evaluated in car tracking scenarios and is shown to give competitive tracking performance, compared to standard approaches, but with the advantage of being fully learnable. The last contribution has been in the field of (cognitive) robot control. The presented method achieves sufficient accuracy for simple assembly tasks by combining autonomous recognition with visual servoing, based on a learned mapping between percepts and actions. The method demonstrates that limitations of inexpensive hardware, such as web cameras and low-cost robotic arms, can be overcome using powerful algorithms. All in all, the methods developed and presented in this thesis can all be used for different components in a system guided by visual information, and hopefully represents a step towards improving traffic safety. Doctoral thesis, comprehensive summaryinfo:eu-repo/semantics/doctoralThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-71664urn:isbn:978-91-7393-074-1Linköping Studies in Science and Technology. Dissertations, 0345-7524 ; 1395application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Doctoral Thesis
sources NDLTD
description Traffic accidents are globally the number one cause of death for people 15-29 years old and is among the top three causes for all age groups 5-44 years. Much of the work within this thesis has been carried out in projects aiming for (cognitive) driver assistance systems and hopefully represents a step towards improving traffic safety. The main contributions are within the area of Computer Vision, and more specifically, within the areas of shape matching, Bayesian tracking, and visual servoing with the main focus being on shape matching and applications thereof. The different methods have been demonstrated in traffic safety applications, such as  bicycle tracking, car tracking, and traffic sign recognition, as well as for pose estimation and robot control. One of the core contributions is a new method for recognizing closed contours, based on complex correlation of Fourier descriptors. It is shown that keeping the phase of Fourier descriptors is important. Neglecting the phase can result in perfect matches between intrinsically different shapes. Another benefit of keeping the phase is that rotation covariant or invariant matching is achieved in the same way. The only difference is to either consider the magnitude, for rotation invariant matching, or just the real value, for rotation covariant matching, of the complex valued correlation. The shape matching method has further been used in combination with an implicit star-shaped object model for traffic sign recognition. The presented method works fully automatically on query images with no need for regions-of-interests. It is shown that the presented method performs well for traffic signs that contain multiple distinct contours, while some improvement still is needed for signs defined by a single contour. The presented methodology is general enough to be used for arbitrary objects, as long as they can be defined by a number of regions. Another contribution has been the extension of a framework for learning based Bayesian tracking called channel based tracking. Compared to earlier work, the multi-dimensional case has been reformulated in a sound probabilistic way and the learning algorithm itself has been extended. The framework is evaluated in car tracking scenarios and is shown to give competitive tracking performance, compared to standard approaches, but with the advantage of being fully learnable. The last contribution has been in the field of (cognitive) robot control. The presented method achieves sufficient accuracy for simple assembly tasks by combining autonomous recognition with visual servoing, based on a learned mapping between percepts and actions. The method demonstrates that limitations of inexpensive hardware, such as web cameras and low-cost robotic arms, can be overcome using powerful algorithms. All in all, the methods developed and presented in this thesis can all be used for different components in a system guided by visual information, and hopefully represents a step towards improving traffic safety.
author Larsson, Fredrik
spellingShingle Larsson, Fredrik
Shape Based Recognition – Cognitive Vision Systems in Traffic Safety Applications
author_facet Larsson, Fredrik
author_sort Larsson, Fredrik
title Shape Based Recognition – Cognitive Vision Systems in Traffic Safety Applications
title_short Shape Based Recognition – Cognitive Vision Systems in Traffic Safety Applications
title_full Shape Based Recognition – Cognitive Vision Systems in Traffic Safety Applications
title_fullStr Shape Based Recognition – Cognitive Vision Systems in Traffic Safety Applications
title_full_unstemmed Shape Based Recognition – Cognitive Vision Systems in Traffic Safety Applications
title_sort shape based recognition – cognitive vision systems in traffic safety applications
publisher Linköpings universitet, Datorseende
publishDate 2011
url http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-71664
http://nbn-resolving.de/urn:isbn:978-91-7393-074-1
work_keys_str_mv AT larssonfredrik shapebasedrecognitioncognitivevisionsystemsintrafficsafetyapplications
_version_ 1718260331986288640