Learning Language-vision Correspondences
Given an unstructured collection of captioned images of cluttered scenes featuring a variety of objects, our goal is to simultaneously learn the names and appearances of the objects. Only a small fraction of local features within any given image are associated with a particular caption word, and cap...
Main Author: | Jamieson, Michael |
---|---|
Other Authors: | Dickinson, Sven |
Language: | en_ca |
Published: |
2010
|
Subjects: | |
Online Access: | http://hdl.handle.net/1807/26192 |
Similar Items
-
Learning Language-vision Correspondences
by: Jamieson, Michael
Published: (2010) -
Color based classification of circular markers for the identification of experimental units
by: Narjala, Lakshmi
Published: (2013) -
A Flexible Object-of-Interest Annotation Framework for Online Video Portals
by: Robert Sorschag
Published: (2012-02-01) -
50,000 Tiny Videos: A Large Dataset for Non-parametric Content-based Retrieval and Recognition
by: Karpenko, Alexandre
Published: (2009) -
50,000 Tiny Videos: A Large Dataset for Non-parametric Content-based Retrieval and Recognition
by: Karpenko, Alexandre
Published: (2009)