Reading between the lines : object localization using implicit cues from image tags

Current uses of tagged images typically exploit only the most explicit information: the link between the nouns named and the objects present somewhere in the image. We propose to leverage “unspoken” cues that rest within an ordered list of image tags so as to improve object localization. We define t...

Full description

Bibliographic Details
Main Author:	Hwang, Sung Ju
Format:	Others
Language:	English
Published:	2010
Subjects:	Computer vision Object recognition Object detection
Online Access:	http://hdl.handle.net/2152/ETD-UT-2010-05-1514

id	ndltd-UTEXAS-oai-repositories.lib.utexas.edu-2152-ETD-UT-2010-05-1514
record_format	oai_dc
spelling	ndltd-UTEXAS-oai-repositories.lib.utexas.edu-2152-ETD-UT-2010-05-15142015-09-20T16:55:44ZReading between the lines : object localization using implicit cues from image tagsHwang, Sung JuComputer visionObject recognitionObject detectionCurrent uses of tagged images typically exploit only the most explicit information: the link between the nouns named and the objects present somewhere in the image. We propose to leverage “unspoken” cues that rest within an ordered list of image tags so as to improve object localization. We define three novel implicit features from an image’s tags—the relative prominence of each object as signified by its order of mention, the scale constraints implied by unnamed objects, and the loose spatial links hinted by the proximity of names on the list. By learning a conditional density over the localization parameters (position and scale) given these cues, we show how to improve both accuracy and efficiency when detecting the tagged objects. We validate our approach with 25 object categories from the PASCAL VOC and LabelMe datasets, and demonstrate its effectiveness relative to both traditional sliding windows as well as a visual context baseline.text2010-11-10T15:14:19Z2010-11-10T15:14:30Z2010-11-10T15:14:19Z2010-11-10T15:14:30Z2010-052010-11-10May 20102010-11-10T15:14:30Zthesisapplication/pdfhttp://hdl.handle.net/2152/ETD-UT-2010-05-1514eng
collection	NDLTD
language	English
format	Others
sources	NDLTD
topic	Computer vision Object recognition Object detection
spellingShingle	Computer vision Object recognition Object detection Hwang, Sung Ju Reading between the lines : object localization using implicit cues from image tags
description	Current uses of tagged images typically exploit only the most explicit information: the link between the nouns named and the objects present somewhere in the image. We propose to leverage “unspoken” cues that rest within an ordered list of image tags so as to improve object localization. We define three novel implicit features from an image’s tags—the relative prominence of each object as signified by its order of mention, the scale constraints implied by unnamed objects, and the loose spatial links hinted by the proximity of names on the list. By learning a conditional density over the localization parameters (position and scale) given these cues, we show how to improve both accuracy and efficiency when detecting the tagged objects. We validate our approach with 25 object categories from the PASCAL VOC and LabelMe datasets, and demonstrate its effectiveness relative to both traditional sliding windows as well as a visual context baseline. === text
author	Hwang, Sung Ju
author_facet	Hwang, Sung Ju
author_sort	Hwang, Sung Ju
title	Reading between the lines : object localization using implicit cues from image tags
title_short	Reading between the lines : object localization using implicit cues from image tags
title_full	Reading between the lines : object localization using implicit cues from image tags
title_fullStr	Reading between the lines : object localization using implicit cues from image tags
title_full_unstemmed	Reading between the lines : object localization using implicit cues from image tags
title_sort	reading between the lines : object localization using implicit cues from image tags
publishDate	2010
url	http://hdl.handle.net/2152/ETD-UT-2010-05-1514
work_keys_str_mv	AT hwangsungju readingbetweenthelinesobjectlocalizationusingimplicitcuesfromimagetags
_version_	1716821064076492800

Reading between the lines : object localization using implicit cues from image tags

Similar Items