Learning Language-vision Correspondences

Given an unstructured collection of captioned images of cluttered scenes featuring a variety of objects, our goal is to simultaneously learn the names and appearances of the objects. Only a small fraction of local features within any given image are associated with a particular caption word, and cap...

Full description

Bibliographic Details
Main Author:	Jamieson, Michael
Other Authors:	Dickinson, Sven
Language:	en_ca
Published:	2010
Subjects:	image annotation object recognition 0984
Online Access:	http://hdl.handle.net/1807/26192

Internet

http://hdl.handle.net/1807/26192

Learning Language-vision Correspondences

Internet

Similar Items