Multiscale conditional random fields for machine vision
We develop a single joint model which can classify images and label super-pixels, based on tree-structured conditional random fields (CRFs) derived from a hierarchical image segmentation, extending previous work by Reynolds and Murphy, and Plath and Toussaint. We show how to train this model in a w...
Main Author: | |
---|---|
Language: | English |
Published: |
University of British Columbia
2010
|
Online Access: | http://hdl.handle.net/2429/27032 |
id |
ndltd-UBC-oai-circle.library.ubc.ca-2429-27032 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UBC-oai-circle.library.ubc.ca-2429-270322018-01-05T17:24:24Z Multiscale conditional random fields for machine vision Duvenaud, David We develop a single joint model which can classify images and label super-pixels, based on tree-structured conditional random fields (CRFs) derived from a hierarchical image segmentation, extending previous work by Reynolds and Murphy, and Plath and Toussaint. We show how to train this model in a weakly-supervised fashion, in which some of the images only have captions specifying which objects are present; this information is propagated down the tree and thus provides weakly labeled data at the leaves, which can be used to improve the performance of the super-pixel classifiers. After training, information can be propagated from the super-pixels up to the root-level image classifier (although this does not seem to help in practice compared to just using root-level features). We compare two kinds of tree: the standard one with pairwise potentials, and one based on noisy-or potentials, which better matches the semantics of the recursive partitioning used to create the tree. However, we do not find any significant difference between the two. Science, Faculty of Computer Science, Department of Graduate 2010-07-30T17:00:22Z 2010-07-30T17:00:22Z 2010 2010-11 Text Thesis/Dissertation http://hdl.handle.net/2429/27032 eng Attribution-NonCommercial 3.0 Unported http://creativecommons.org/licenses/by-nc/3.0/ University of British Columbia |
collection |
NDLTD |
language |
English |
sources |
NDLTD |
description |
We develop a single joint model which can classify images and label super-pixels, based on tree-structured conditional random fields (CRFs) derived from a hierarchical image segmentation, extending previous work by Reynolds and Murphy, and Plath and Toussaint. We show how to train this model in a weakly-supervised fashion, in which some of the images only have captions specifying which objects are present; this information is propagated down the tree and thus provides weakly labeled data at the leaves, which can be used to improve the performance of the super-pixel classifiers. After training, information can be propagated from the super-pixels up to the root-level image classifier (although this does not seem to help in practice compared to just using root-level features). We compare two kinds of tree: the standard one with pairwise potentials, and one based on noisy-or potentials, which better matches the semantics of the recursive partitioning used to create the tree. However, we do not find any significant difference between the two. === Science, Faculty of === Computer Science, Department of === Graduate |
author |
Duvenaud, David |
spellingShingle |
Duvenaud, David Multiscale conditional random fields for machine vision |
author_facet |
Duvenaud, David |
author_sort |
Duvenaud, David |
title |
Multiscale conditional random fields for machine vision |
title_short |
Multiscale conditional random fields for machine vision |
title_full |
Multiscale conditional random fields for machine vision |
title_fullStr |
Multiscale conditional random fields for machine vision |
title_full_unstemmed |
Multiscale conditional random fields for machine vision |
title_sort |
multiscale conditional random fields for machine vision |
publisher |
University of British Columbia |
publishDate |
2010 |
url |
http://hdl.handle.net/2429/27032 |
work_keys_str_mv |
AT duvenauddavid multiscaleconditionalrandomfieldsformachinevision |
_version_ |
1718582538696392704 |