Recognition of people, objects and places

The recognition approach for people (faces) and objects is general and does not use any contextual information. The algorithm is based on feature points. It has a learning stage which attempts to maximise information theoretic entropy and learns relational information between extracted feature point...

Full description

Bibliographic Details
Main Author: Gan, S. Y.
Published: University of Cambridge 2007
Subjects:
Online Access:http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.599292
id ndltd-bl.uk-oai-ethos.bl.uk-599292
record_format oai_dc
spelling ndltd-bl.uk-oai-ethos.bl.uk-5992922015-03-20T06:05:14ZRecognition of people, objects and placesGan, S. Y.2007The recognition approach for people (faces) and objects is general and does not use any contextual information. The algorithm is based on feature points. It has a learning stage which attempts to maximise information theoretic entropy and learns relational information between extracted feature points. The result of learning is an ordered list of three-dimensional moves in space and scale, termed <i>jumps. </i>These jumps are performed during the runtime recognition stage to differentiate between similar feature points to achieve recognition. The approach can be aptly described by the following question, “Where do I look next in space and scale to gain the most information about what I am looking at?” Closely related to localisation is navigation. Without localisation, navigation is impossible. The research in this document examines how these two are intertwined. The approach taken to tackle navigation is one which does not require the use of a globally consistent map. This removes the need to build such a map which can be computational expensive and difficult to build. An augmented reality application which is able to guide a human user from one location to another is presented. Key-frames are extracted from a training video sequence which shows the path to the destination. Local reference frames are built from pairs of key-frames and there is no necessity to have a consistent scale across local reference frames. Navigation is achieved by solving a series of smaller navigation tasks. At runtime, a live frame is localised to a local reference frame and navigation is achieved by moving from one local reference frame to the next. An iterative re-weighted least squares estimator is used for pose estimation and a Kalman filter is used as a smoothing agent to reduce the effects of jitter and outlying pose estimates. The system runs at about 5Hz on 640 x 480 frames.621.382University of Cambridgehttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.599292Electronic Thesis or Dissertation
collection NDLTD
sources NDLTD
topic 621.382
spellingShingle 621.382
Gan, S. Y.
Recognition of people, objects and places
description The recognition approach for people (faces) and objects is general and does not use any contextual information. The algorithm is based on feature points. It has a learning stage which attempts to maximise information theoretic entropy and learns relational information between extracted feature points. The result of learning is an ordered list of three-dimensional moves in space and scale, termed <i>jumps. </i>These jumps are performed during the runtime recognition stage to differentiate between similar feature points to achieve recognition. The approach can be aptly described by the following question, “Where do I look next in space and scale to gain the most information about what I am looking at?” Closely related to localisation is navigation. Without localisation, navigation is impossible. The research in this document examines how these two are intertwined. The approach taken to tackle navigation is one which does not require the use of a globally consistent map. This removes the need to build such a map which can be computational expensive and difficult to build. An augmented reality application which is able to guide a human user from one location to another is presented. Key-frames are extracted from a training video sequence which shows the path to the destination. Local reference frames are built from pairs of key-frames and there is no necessity to have a consistent scale across local reference frames. Navigation is achieved by solving a series of smaller navigation tasks. At runtime, a live frame is localised to a local reference frame and navigation is achieved by moving from one local reference frame to the next. An iterative re-weighted least squares estimator is used for pose estimation and a Kalman filter is used as a smoothing agent to reduce the effects of jitter and outlying pose estimates. The system runs at about 5Hz on 640 x 480 frames.
author Gan, S. Y.
author_facet Gan, S. Y.
author_sort Gan, S. Y.
title Recognition of people, objects and places
title_short Recognition of people, objects and places
title_full Recognition of people, objects and places
title_fullStr Recognition of people, objects and places
title_full_unstemmed Recognition of people, objects and places
title_sort recognition of people, objects and places
publisher University of Cambridge
publishDate 2007
url http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.599292
work_keys_str_mv AT gansy recognitionofpeopleobjectsandplaces
_version_ 1716795619596566528