A geometric framework for dynamic vision

This thesis explores the problem of inferring information about the three-dimensional world from its projections onto a camera (images). Among all visual cues, we do not address "pictorial" ones, such as texture or shading. Instead, we concentrate on "dynamic" cues, which are ass...

Full description

Bibliographic Details
Main Author: Soatto, Stefano
Format: Others
Language:en
Published: 1996
Online Access:https://thesis.library.caltech.edu/68/1/Soatto_s_1996.pdf
Soatto, Stefano (1996) A geometric framework for dynamic vision. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/x87w-t943. https://resolver.caltech.edu/CaltechETD:etd-01082008-103705 <https://resolver.caltech.edu/CaltechETD:etd-01082008-103705>
id ndltd-CALTECH-oai-thesis.library.caltech.edu-68
record_format oai_dc
spelling ndltd-CALTECH-oai-thesis.library.caltech.edu-682021-04-20T05:01:32Z https://thesis.library.caltech.edu/68/ A geometric framework for dynamic vision Soatto, Stefano This thesis explores the problem of inferring information about the three-dimensional world from its projections onto a camera (images). Among all visual cues, we do not address "pictorial" ones, such as texture or shading. Instead, we concentrate on "dynamic" cues, which are associated with variations of the image over time. In order to eliminate pictorial cues, one may represent the world as a collection of geometric primitives, such as points, curves or surfaces in three-dimensional space. Then, from the two-dimensional motion of the projection of such primitives onto the image, one can infer the three-dimensional structure of the world and its motion relative to the viewer. "Three-dimensional structure from two-dimensional images" has now been a central theme in Computer Vision for over two decades, and tools from Linear Algebra and Projective Geometry have been widely employed to attack the problem as a "static" task. It is only in recent years that the role of time has started to be recognized, after the influential work of Dickmanns and his coworkers on vehicle guidance on freeways. We do not impose restrictions on the structure of the environment, and we cast the problem of general three-dimensional structure and motion estimation within the framework of Dynamical Systems. We show how different algebraic constraints on the image projections can be interpreted as nonlinear and implicit dynamical models whose (unknown) parameters live in peculiar differentiable manifolds that encode three-dimensional information. Recovering such three-dimensional information then amounts to identifying dynamical models while taking into account the geometry of the parameter manifolds. 1996 Thesis NonPeerReviewed application/pdf en other https://thesis.library.caltech.edu/68/1/Soatto_s_1996.pdf Soatto, Stefano (1996) A geometric framework for dynamic vision. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/x87w-t943. https://resolver.caltech.edu/CaltechETD:etd-01082008-103705 <https://resolver.caltech.edu/CaltechETD:etd-01082008-103705> https://resolver.caltech.edu/CaltechETD:etd-01082008-103705 CaltechETD:etd-01082008-103705 10.7907/x87w-t943
collection NDLTD
language en
format Others
sources NDLTD
description This thesis explores the problem of inferring information about the three-dimensional world from its projections onto a camera (images). Among all visual cues, we do not address "pictorial" ones, such as texture or shading. Instead, we concentrate on "dynamic" cues, which are associated with variations of the image over time. In order to eliminate pictorial cues, one may represent the world as a collection of geometric primitives, such as points, curves or surfaces in three-dimensional space. Then, from the two-dimensional motion of the projection of such primitives onto the image, one can infer the three-dimensional structure of the world and its motion relative to the viewer. "Three-dimensional structure from two-dimensional images" has now been a central theme in Computer Vision for over two decades, and tools from Linear Algebra and Projective Geometry have been widely employed to attack the problem as a "static" task. It is only in recent years that the role of time has started to be recognized, after the influential work of Dickmanns and his coworkers on vehicle guidance on freeways. We do not impose restrictions on the structure of the environment, and we cast the problem of general three-dimensional structure and motion estimation within the framework of Dynamical Systems. We show how different algebraic constraints on the image projections can be interpreted as nonlinear and implicit dynamical models whose (unknown) parameters live in peculiar differentiable manifolds that encode three-dimensional information. Recovering such three-dimensional information then amounts to identifying dynamical models while taking into account the geometry of the parameter manifolds.
author Soatto, Stefano
spellingShingle Soatto, Stefano
A geometric framework for dynamic vision
author_facet Soatto, Stefano
author_sort Soatto, Stefano
title A geometric framework for dynamic vision
title_short A geometric framework for dynamic vision
title_full A geometric framework for dynamic vision
title_fullStr A geometric framework for dynamic vision
title_full_unstemmed A geometric framework for dynamic vision
title_sort geometric framework for dynamic vision
publishDate 1996
url https://thesis.library.caltech.edu/68/1/Soatto_s_1996.pdf
Soatto, Stefano (1996) A geometric framework for dynamic vision. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/x87w-t943. https://resolver.caltech.edu/CaltechETD:etd-01082008-103705 <https://resolver.caltech.edu/CaltechETD:etd-01082008-103705>
work_keys_str_mv AT soattostefano ageometricframeworkfordynamicvision
AT soattostefano geometricframeworkfordynamicvision
_version_ 1719397368190730240