Recognition of people, objects and places

The recognition approach for people (faces) and objects is general and does not use any contextual information. The algorithm is based on feature points. It has a learning stage which attempts to maximise information theoretic entropy and learns relational information between extracted feature point...

Full description

Bibliographic Details
Main Author:	Gan, S. Y.
Published:	University of Cambridge 2007
Subjects:	621.382
Online Access:	http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.599292

id	ndltd-bl.uk-oai-ethos.bl.uk-599292
record_format	oai_dc
spelling	ndltd-bl.uk-oai-ethos.bl.uk-5992922015-03-20T06:05:14ZRecognition of people, objects and placesGan, S. Y.2007The recognition approach for people (faces) and objects is general and does not use any contextual information. The algorithm is based on feature points. It has a learning stage which attempts to maximise information theoretic entropy and learns relational information between extracted feature points. The result of learning is an ordered list of three-dimensional moves in space and scale, termed <i>jumps. </i>These jumps are performed during the runtime recognition stage to differentiate between similar feature points to achieve recognition. The approach can be aptly described by the following question, “Where do I look next in space and scale to gain the most information about what I am looking at?” Closely related to localisation is navigation. Without localisation, navigation is impossible. The research in this document examines how these two are intertwined. The approach taken to tackle navigation is one which does not require the use of a globally consistent map. This removes the need to build such a map which can be computational expensive and difficult to build. An augmented reality application which is able to guide a human user from one location to another is presented. Key-frames are extracted from a training video sequence which shows the path to the destination. Local reference frames are built from pairs of key-frames and there is no necessity to have a consistent scale across local reference frames. Navigation is achieved by solving a series of smaller navigation tasks. At runtime, a live frame is localised to a local reference frame and navigation is achieved by moving from one local reference frame to the next. An iterative re-weighted least squares estimator is used for pose estimation and a Kalman filter is used as a smoothing agent to reduce the effects of jitter and outlying pose estimates. The system runs at about 5Hz on 640 x 480 frames.621.382University of Cambridgehttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.599292Electronic Thesis or Dissertation
collection	NDLTD
sources	NDLTD
topic	621.382
spellingShingle	621.382 Gan, S. Y. Recognition of people, objects and places
description	The recognition approach for people (faces) and objects is general and does not use any contextual information. The algorithm is based on feature points. It has a learning stage which attempts to maximise information theoretic entropy and learns relational information between extracted feature points. The result of learning is an ordered list of three-dimensional moves in space and scale, termed <i>jumps. </i>These jumps are performed during the runtime recognition stage to differentiate between similar feature points to achieve recognition. The approach can be aptly described by the following question, “Where do I look next in space and scale to gain the most information about what I am looking at?” Closely related to localisation is navigation. Without localisation, navigation is impossible. The research in this document examines how these two are intertwined. The approach taken to tackle navigation is one which does not require the use of a globally consistent map. This removes the need to build such a map which can be computational expensive and difficult to build. An augmented reality application which is able to guide a human user from one location to another is presented. Key-frames are extracted from a training video sequence which shows the path to the destination. Local reference frames are built from pairs of key-frames and there is no necessity to have a consistent scale across local reference frames. Navigation is achieved by solving a series of smaller navigation tasks. At runtime, a live frame is localised to a local reference frame and navigation is achieved by moving from one local reference frame to the next. An iterative re-weighted least squares estimator is used for pose estimation and a Kalman filter is used as a smoothing agent to reduce the effects of jitter and outlying pose estimates. The system runs at about 5Hz on 640 x 480 frames.
author	Gan, S. Y.
author_facet	Gan, S. Y.
author_sort	Gan, S. Y.
title	Recognition of people, objects and places
title_short	Recognition of people, objects and places
title_full	Recognition of people, objects and places
title_fullStr	Recognition of people, objects and places
title_full_unstemmed	Recognition of people, objects and places
title_sort	recognition of people, objects and places
publisher	University of Cambridge
publishDate	2007
url	http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.599292
work_keys_str_mv	AT gansy recognitionofpeopleobjectsandplaces
_version_	1716795619596566528

Recognition of people, objects and places

Similar Items