Reading between the lines : object localization using implicit cues from image tags
Current uses of tagged images typically exploit only the most explicit information: the link between the nouns named and the objects present somewhere in the image. We propose to leverage “unspoken” cues that rest within an ordered list of image tags so as to improve object localization. We define t...
Main Author: | |
---|---|
Format: | Others |
Language: | English |
Published: |
2010
|
Subjects: | |
Online Access: | http://hdl.handle.net/2152/ETD-UT-2010-05-1514 |
id |
ndltd-UTEXAS-oai-repositories.lib.utexas.edu-2152-ETD-UT-2010-05-1514 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UTEXAS-oai-repositories.lib.utexas.edu-2152-ETD-UT-2010-05-15142015-09-20T16:55:44ZReading between the lines : object localization using implicit cues from image tagsHwang, Sung JuComputer visionObject recognitionObject detectionCurrent uses of tagged images typically exploit only the most explicit information: the link between the nouns named and the objects present somewhere in the image. We propose to leverage “unspoken” cues that rest within an ordered list of image tags so as to improve object localization. We define three novel implicit features from an image’s tags—the relative prominence of each object as signified by its order of mention, the scale constraints implied by unnamed objects, and the loose spatial links hinted by the proximity of names on the list. By learning a conditional density over the localization parameters (position and scale) given these cues, we show how to improve both accuracy and efficiency when detecting the tagged objects. We validate our approach with 25 object categories from the PASCAL VOC and LabelMe datasets, and demonstrate its effectiveness relative to both traditional sliding windows as well as a visual context baseline.text2010-11-10T15:14:19Z2010-11-10T15:14:30Z2010-11-10T15:14:19Z2010-11-10T15:14:30Z2010-052010-11-10May 20102010-11-10T15:14:30Zthesisapplication/pdfhttp://hdl.handle.net/2152/ETD-UT-2010-05-1514eng |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
Computer vision Object recognition Object detection |
spellingShingle |
Computer vision Object recognition Object detection Hwang, Sung Ju Reading between the lines : object localization using implicit cues from image tags |
description |
Current uses of tagged images typically exploit only
the most explicit information: the link between the nouns
named and the objects present somewhere in the image. We
propose to leverage “unspoken” cues that rest within an
ordered list of image tags so as to improve object localization.
We define three novel implicit features from an image’s
tags—the relative prominence of each object as signified
by its order of mention, the scale constraints implied
by unnamed objects, and the loose spatial links hinted by
the proximity of names on the list. By learning a conditional
density over the localization parameters (position
and scale) given these cues, we show how to improve both
accuracy and efficiency when detecting the tagged objects.
We validate our approach with 25 object categories from
the PASCAL VOC and LabelMe datasets, and demonstrate
its effectiveness relative to both traditional sliding windows
as well as a visual context baseline. === text |
author |
Hwang, Sung Ju |
author_facet |
Hwang, Sung Ju |
author_sort |
Hwang, Sung Ju |
title |
Reading between the lines : object localization using implicit cues from image tags |
title_short |
Reading between the lines : object localization using implicit cues from image tags |
title_full |
Reading between the lines : object localization using implicit cues from image tags |
title_fullStr |
Reading between the lines : object localization using implicit cues from image tags |
title_full_unstemmed |
Reading between the lines : object localization using implicit cues from image tags |
title_sort |
reading between the lines : object localization using implicit cues from image tags |
publishDate |
2010 |
url |
http://hdl.handle.net/2152/ETD-UT-2010-05-1514 |
work_keys_str_mv |
AT hwangsungju readingbetweenthelinesobjectlocalizationusingimplicitcuesfromimagetags |
_version_ |
1716821064076492800 |