A geometric framework for dynamic vision

This thesis explores the problem of inferring information about the three-dimensional world from its projections onto a camera (images). Among all visual cues, we do not address "pictorial" ones, such as texture or shading. Instead, we concentrate on "dynamic" cues, which are ass...

Full description

Bibliographic Details
Main Author:	Soatto, Stefano
Format:	Others
Language:	en
Published:	1996
Online Access:	https://thesis.library.caltech.edu/68/1/Soatto_s_1996.pdf Soatto, Stefano (1996) A geometric framework for dynamic vision. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/x87w-t943. https://resolver.caltech.edu/CaltechETD:etd-01082008-103705 <https://resolver.caltech.edu/CaltechETD:etd-01082008-103705>

id	ndltd-CALTECH-oai-thesis.library.caltech.edu-68
record_format	oai_dc
spelling	ndltd-CALTECH-oai-thesis.library.caltech.edu-682021-04-20T05:01:32Z https://thesis.library.caltech.edu/68/ A geometric framework for dynamic vision Soatto, Stefano This thesis explores the problem of inferring information about the three-dimensional world from its projections onto a camera (images). Among all visual cues, we do not address "pictorial" ones, such as texture or shading. Instead, we concentrate on "dynamic" cues, which are associated with variations of the image over time. In order to eliminate pictorial cues, one may represent the world as a collection of geometric primitives, such as points, curves or surfaces in three-dimensional space. Then, from the two-dimensional motion of the projection of such primitives onto the image, one can infer the three-dimensional structure of the world and its motion relative to the viewer. "Three-dimensional structure from two-dimensional images" has now been a central theme in Computer Vision for over two decades, and tools from Linear Algebra and Projective Geometry have been widely employed to attack the problem as a "static" task. It is only in recent years that the role of time has started to be recognized, after the influential work of Dickmanns and his coworkers on vehicle guidance on freeways. We do not impose restrictions on the structure of the environment, and we cast the problem of general three-dimensional structure and motion estimation within the framework of Dynamical Systems. We show how different algebraic constraints on the image projections can be interpreted as nonlinear and implicit dynamical models whose (unknown) parameters live in peculiar differentiable manifolds that encode three-dimensional information. Recovering such three-dimensional information then amounts to identifying dynamical models while taking into account the geometry of the parameter manifolds. 1996 Thesis NonPeerReviewed application/pdf en other https://thesis.library.caltech.edu/68/1/Soatto_s_1996.pdf Soatto, Stefano (1996) A geometric framework for dynamic vision. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/x87w-t943. https://resolver.caltech.edu/CaltechETD:etd-01082008-103705 <https://resolver.caltech.edu/CaltechETD:etd-01082008-103705> https://resolver.caltech.edu/CaltechETD:etd-01082008-103705 CaltechETD:etd-01082008-103705 10.7907/x87w-t943
collection	NDLTD
language	en
format	Others
sources	NDLTD
description	This thesis explores the problem of inferring information about the three-dimensional world from its projections onto a camera (images). Among all visual cues, we do not address "pictorial" ones, such as texture or shading. Instead, we concentrate on "dynamic" cues, which are associated with variations of the image over time. In order to eliminate pictorial cues, one may represent the world as a collection of geometric primitives, such as points, curves or surfaces in three-dimensional space. Then, from the two-dimensional motion of the projection of such primitives onto the image, one can infer the three-dimensional structure of the world and its motion relative to the viewer. "Three-dimensional structure from two-dimensional images" has now been a central theme in Computer Vision for over two decades, and tools from Linear Algebra and Projective Geometry have been widely employed to attack the problem as a "static" task. It is only in recent years that the role of time has started to be recognized, after the influential work of Dickmanns and his coworkers on vehicle guidance on freeways. We do not impose restrictions on the structure of the environment, and we cast the problem of general three-dimensional structure and motion estimation within the framework of Dynamical Systems. We show how different algebraic constraints on the image projections can be interpreted as nonlinear and implicit dynamical models whose (unknown) parameters live in peculiar differentiable manifolds that encode three-dimensional information. Recovering such three-dimensional information then amounts to identifying dynamical models while taking into account the geometry of the parameter manifolds.
author	Soatto, Stefano
spellingShingle	Soatto, Stefano A geometric framework for dynamic vision
author_facet	Soatto, Stefano
author_sort	Soatto, Stefano
title	A geometric framework for dynamic vision
title_short	A geometric framework for dynamic vision
title_full	A geometric framework for dynamic vision
title_fullStr	A geometric framework for dynamic vision
title_full_unstemmed	A geometric framework for dynamic vision
title_sort	geometric framework for dynamic vision
publishDate	1996
url	https://thesis.library.caltech.edu/68/1/Soatto_s_1996.pdf Soatto, Stefano (1996) A geometric framework for dynamic vision. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/x87w-t943. https://resolver.caltech.edu/CaltechETD:etd-01082008-103705 <https://resolver.caltech.edu/CaltechETD:etd-01082008-103705>
work_keys_str_mv	AT soattostefano ageometricframeworkfordynamicvision AT soattostefano geometricframeworkfordynamicvision
_version_	1719397368190730240

A geometric framework for dynamic vision

Similar Items