Modeling and Predicting Object Attention in Natural Scenes

<p>Humans automatically attend to certain objects in a scene. Better understanding this process could improve a computer's ability to parse scene images and convey information about them to humans. This thesis is arranged in three parts. The first part explores how important a particul...

Full description

Bibliographic Details
Main Author: Spain, Merrielle Therese
Format: Others
Published: 2011
Online Access:https://thesis.library.caltech.edu/6459/2/spain_thesis.pdf
Spain, Merrielle Therese (2011) Modeling and Predicting Object Attention in Natural Scenes. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/JTEE-7367. https://resolver.caltech.edu/CaltechTHESIS:05262011-172742472 <https://resolver.caltech.edu/CaltechTHESIS:05262011-172742472>
id ndltd-CALTECH-oai-thesis.library.caltech.edu-6459
record_format oai_dc
spelling ndltd-CALTECH-oai-thesis.library.caltech.edu-64592019-10-10T03:02:40Z Modeling and Predicting Object Attention in Natural Scenes Spain, Merrielle Therese <p>Humans automatically attend to certain objects in a scene. Better understanding this process could improve a computer's ability to parse scene images and convey information about them to humans. This thesis is arranged in three parts. The first part explores how important a particular object is in a photograph of a complex scene. We propose a definition of importance and present two methods for measuring object importance from human observers. Using this ground truth, we fit a function for predicting the importance of each object directly from a segmented image; our function combines many object-related and image-related features. We validate our importance predictions on a large set of objects and find that the most important objects may be identified automatically. We find that object position and size are particularly informative, while a popular measure of saliency is not.</p> <p>The second part explores the relationship between object naming, eye movements, and saliency maps. Eye movements correlate with shifts in attention and are thought to be a consequence of optimal resource allocation for high-level tasks such as visual recognition. Saliency maps, are often built on the assumption that "early" features (e.g., color, contrast, orientation, and motion) as opposed to objects themselves drive attention. We measure the eye position of humans viewing scenes and then ask them to recall objects that they saw in each scene. Weighted with recall frequency or maximum saliency, these objects predict fixations in individual images better than early saliency, suggesting that early saliency may have an indirect effect on attention, acting through detected objects.</p> <p>The third part explores the problem of locating objects in a scene irrespective of category. We introduce the first benchmark for category-independent object detection. It is composed of a large public dataset of annotated high-resolution scene images and suitable metrics for performance evaluation. We demonstrate our benchmark by comparing three methods for generalized object detection against a baseline and an upper bound.</p> 2011 Thesis NonPeerReviewed application/pdf https://thesis.library.caltech.edu/6459/2/spain_thesis.pdf https://resolver.caltech.edu/CaltechTHESIS:05262011-172742472 Spain, Merrielle Therese (2011) Modeling and Predicting Object Attention in Natural Scenes. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/JTEE-7367. https://resolver.caltech.edu/CaltechTHESIS:05262011-172742472 <https://resolver.caltech.edu/CaltechTHESIS:05262011-172742472> https://thesis.library.caltech.edu/6459/
collection NDLTD
format Others
sources NDLTD
description <p>Humans automatically attend to certain objects in a scene. Better understanding this process could improve a computer's ability to parse scene images and convey information about them to humans. This thesis is arranged in three parts. The first part explores how important a particular object is in a photograph of a complex scene. We propose a definition of importance and present two methods for measuring object importance from human observers. Using this ground truth, we fit a function for predicting the importance of each object directly from a segmented image; our function combines many object-related and image-related features. We validate our importance predictions on a large set of objects and find that the most important objects may be identified automatically. We find that object position and size are particularly informative, while a popular measure of saliency is not.</p> <p>The second part explores the relationship between object naming, eye movements, and saliency maps. Eye movements correlate with shifts in attention and are thought to be a consequence of optimal resource allocation for high-level tasks such as visual recognition. Saliency maps, are often built on the assumption that "early" features (e.g., color, contrast, orientation, and motion) as opposed to objects themselves drive attention. We measure the eye position of humans viewing scenes and then ask them to recall objects that they saw in each scene. Weighted with recall frequency or maximum saliency, these objects predict fixations in individual images better than early saliency, suggesting that early saliency may have an indirect effect on attention, acting through detected objects.</p> <p>The third part explores the problem of locating objects in a scene irrespective of category. We introduce the first benchmark for category-independent object detection. It is composed of a large public dataset of annotated high-resolution scene images and suitable metrics for performance evaluation. We demonstrate our benchmark by comparing three methods for generalized object detection against a baseline and an upper bound.</p>
author Spain, Merrielle Therese
spellingShingle Spain, Merrielle Therese
Modeling and Predicting Object Attention in Natural Scenes
author_facet Spain, Merrielle Therese
author_sort Spain, Merrielle Therese
title Modeling and Predicting Object Attention in Natural Scenes
title_short Modeling and Predicting Object Attention in Natural Scenes
title_full Modeling and Predicting Object Attention in Natural Scenes
title_fullStr Modeling and Predicting Object Attention in Natural Scenes
title_full_unstemmed Modeling and Predicting Object Attention in Natural Scenes
title_sort modeling and predicting object attention in natural scenes
publishDate 2011
url https://thesis.library.caltech.edu/6459/2/spain_thesis.pdf
Spain, Merrielle Therese (2011) Modeling and Predicting Object Attention in Natural Scenes. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/JTEE-7367. https://resolver.caltech.edu/CaltechTHESIS:05262011-172742472 <https://resolver.caltech.edu/CaltechTHESIS:05262011-172742472>
work_keys_str_mv AT spainmerrielletherese modelingandpredictingobjectattentioninnaturalscenes
_version_ 1719263313561387008