Facial feature localization using highly flexible yet sufficiently strict shape models

Accurate and efficient localization of facial features is a crucial first step in many face-related computer vision tasks. Some of these tasks include, but not limited to: identity recognition, expression recognition, and head-pose estimation. Most effort in the field has been exerted towards devel...

Full description

Bibliographic Details
Main Author: Tamersoy, Birgi
Format: Others
Language:en
Published: 2014
Subjects:
Online Access:http://hdl.handle.net/2152/25995
id ndltd-UTEXAS-oai-repositories.lib.utexas.edu-2152-25995
record_format oai_dc
spelling ndltd-UTEXAS-oai-repositories.lib.utexas.edu-2152-259952015-09-20T17:25:30ZFacial feature localization using highly flexible yet sufficiently strict shape modelsTamersoy, BirgiFacial feature localizationProbabilistic graphical modelsBelief propagationNonparametric belief propagationAccurate and efficient localization of facial features is a crucial first step in many face-related computer vision tasks. Some of these tasks include, but not limited to: identity recognition, expression recognition, and head-pose estimation. Most effort in the field has been exerted towards developing better ways of modeling prior appearance knowledge and image observations. Modeling prior shape knowledge, on the other hand, has not been explored as much. In this dissertation I primarily focus on the limitations of the existing methods in terms of modeling the prior shape knowledge. I first introduce a new pose-constrained shape model. I describe my shape model as being "highly flexible yet sufficiently strict". Existing pose-constrained shape models are either too strict, and have questionable generalization power, or they are too loose, and have questionable localization accuracies. My model tries to find a good middle-ground by learning which shape constraints are more "informative" and should be kept, and which ones are not-so-important and may be omitted. I build my pose-constrained facial feature localization approach on this new shape model using a probabilistic graphical model framework. Within this framework, observed and unobserved variables are defined as the local image observations, and the feature locations, respectively. Feature localization, or "probabilistic inference", is then achieved by nonparametric belief propagation. I show that this approach outperforms other popular pose-constrained methods through qualitative and quantitative experiments. Next, I expand my pose-constrained localization approach to unconstrained setting using a multi-model strategy. While doing so, once again I identify and address the two key limitations of existing multi-model methods: 1) semantically and manually defining the models or "guiding" their generation, and 2) not having efficient and effective model selection strategies. First, I introduce an approach based on unsupervised clustering where the models are automatically learned from training data. Then, I complement this approach with an efficient and effective model selection strategy, which is based on a multi-class naive Bayesian classifier. This way, my method can have many more models, each with a higher level of expressive power, and consequently, provides a more effective partitioning of the face image space. This approach is validated through extensive experiments and comparisons with state-of-the-art methods on state-of-the-art datasets. In the last part of this dissertation I discuss a particular application of the previously introduced techniques; facial feature localization in unconstrained videos. I improve the frame-by-frame localization results, by estimating the actual head-movement from a sequence of noisy head-pose estimates, and then using this information for detecting and fixing the localization failures.text2014-09-18T18:25:42Z2014-082014-07-01August 20142014-09-18T18:25:42ZThesisapplication/pdfhttp://hdl.handle.net/2152/25995en
collection NDLTD
language en
format Others
sources NDLTD
topic Facial feature localization
Probabilistic graphical models
Belief propagation
Nonparametric belief propagation
spellingShingle Facial feature localization
Probabilistic graphical models
Belief propagation
Nonparametric belief propagation
Tamersoy, Birgi
Facial feature localization using highly flexible yet sufficiently strict shape models
description Accurate and efficient localization of facial features is a crucial first step in many face-related computer vision tasks. Some of these tasks include, but not limited to: identity recognition, expression recognition, and head-pose estimation. Most effort in the field has been exerted towards developing better ways of modeling prior appearance knowledge and image observations. Modeling prior shape knowledge, on the other hand, has not been explored as much. In this dissertation I primarily focus on the limitations of the existing methods in terms of modeling the prior shape knowledge. I first introduce a new pose-constrained shape model. I describe my shape model as being "highly flexible yet sufficiently strict". Existing pose-constrained shape models are either too strict, and have questionable generalization power, or they are too loose, and have questionable localization accuracies. My model tries to find a good middle-ground by learning which shape constraints are more "informative" and should be kept, and which ones are not-so-important and may be omitted. I build my pose-constrained facial feature localization approach on this new shape model using a probabilistic graphical model framework. Within this framework, observed and unobserved variables are defined as the local image observations, and the feature locations, respectively. Feature localization, or "probabilistic inference", is then achieved by nonparametric belief propagation. I show that this approach outperforms other popular pose-constrained methods through qualitative and quantitative experiments. Next, I expand my pose-constrained localization approach to unconstrained setting using a multi-model strategy. While doing so, once again I identify and address the two key limitations of existing multi-model methods: 1) semantically and manually defining the models or "guiding" their generation, and 2) not having efficient and effective model selection strategies. First, I introduce an approach based on unsupervised clustering where the models are automatically learned from training data. Then, I complement this approach with an efficient and effective model selection strategy, which is based on a multi-class naive Bayesian classifier. This way, my method can have many more models, each with a higher level of expressive power, and consequently, provides a more effective partitioning of the face image space. This approach is validated through extensive experiments and comparisons with state-of-the-art methods on state-of-the-art datasets. In the last part of this dissertation I discuss a particular application of the previously introduced techniques; facial feature localization in unconstrained videos. I improve the frame-by-frame localization results, by estimating the actual head-movement from a sequence of noisy head-pose estimates, and then using this information for detecting and fixing the localization failures. === text
author Tamersoy, Birgi
author_facet Tamersoy, Birgi
author_sort Tamersoy, Birgi
title Facial feature localization using highly flexible yet sufficiently strict shape models
title_short Facial feature localization using highly flexible yet sufficiently strict shape models
title_full Facial feature localization using highly flexible yet sufficiently strict shape models
title_fullStr Facial feature localization using highly flexible yet sufficiently strict shape models
title_full_unstemmed Facial feature localization using highly flexible yet sufficiently strict shape models
title_sort facial feature localization using highly flexible yet sufficiently strict shape models
publishDate 2014
url http://hdl.handle.net/2152/25995
work_keys_str_mv AT tamersoybirgi facialfeaturelocalizationusinghighlyflexibleyetsufficientlystrictshapemodels
_version_ 1716823932501229568