Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data.

In forensic science, trace evidence found at a crime scene and on suspect has to be evaluated from the measurements performed on them, usually in the form of multivariate data (for example, several chemical compound or physical characteristics). In order to assess the strength of that evidence, the...

Full description

Bibliographic Details
Main Authors: Javier Franco-Pedroso, Daniel Ramos, Joaquin Gonzalez-Rodriguez
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2016-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4762660?pdf=render
id doaj-d978aed83f35481289770edf9b146da9
record_format Article
spelling doaj-d978aed83f35481289770edf9b146da92020-11-24T20:50:51ZengPublic Library of Science (PLoS)PLoS ONE1932-62032016-01-01112e014995810.1371/journal.pone.0149958Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data.Javier Franco-PedrosoDaniel RamosJoaquin Gonzalez-RodriguezIn forensic science, trace evidence found at a crime scene and on suspect has to be evaluated from the measurements performed on them, usually in the form of multivariate data (for example, several chemical compound or physical characteristics). In order to assess the strength of that evidence, the likelihood ratio framework is being increasingly adopted. Several methods have been derived in order to obtain likelihood ratios directly from univariate or multivariate data by modelling both the variation appearing between observations (or features) coming from the same source (within-source variation) and that appearing between observations coming from different sources (between-source variation). In the widely used multivariate kernel likelihood-ratio, the within-source distribution is assumed to be normally distributed and constant among different sources and the between-source variation is modelled through a kernel density function (KDF). In order to better fit the observed distribution of the between-source variation, this paper presents a different approach in which a Gaussian mixture model (GMM) is used instead of a KDF. As it will be shown, this approach provides better-calibrated likelihood ratios as measured by the log-likelihood ratio cost (Cllr) in experiments performed on freely available forensic datasets involving different trace evidences: inks, glass fragments and car paints.http://europepmc.org/articles/PMC4762660?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Javier Franco-Pedroso
Daniel Ramos
Joaquin Gonzalez-Rodriguez
spellingShingle Javier Franco-Pedroso
Daniel Ramos
Joaquin Gonzalez-Rodriguez
Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data.
PLoS ONE
author_facet Javier Franco-Pedroso
Daniel Ramos
Joaquin Gonzalez-Rodriguez
author_sort Javier Franco-Pedroso
title Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data.
title_short Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data.
title_full Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data.
title_fullStr Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data.
title_full_unstemmed Gaussian Mixture Models of Between-Source Variation for Likelihood Ratio Computation from Multivariate Data.
title_sort gaussian mixture models of between-source variation for likelihood ratio computation from multivariate data.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2016-01-01
description In forensic science, trace evidence found at a crime scene and on suspect has to be evaluated from the measurements performed on them, usually in the form of multivariate data (for example, several chemical compound or physical characteristics). In order to assess the strength of that evidence, the likelihood ratio framework is being increasingly adopted. Several methods have been derived in order to obtain likelihood ratios directly from univariate or multivariate data by modelling both the variation appearing between observations (or features) coming from the same source (within-source variation) and that appearing between observations coming from different sources (between-source variation). In the widely used multivariate kernel likelihood-ratio, the within-source distribution is assumed to be normally distributed and constant among different sources and the between-source variation is modelled through a kernel density function (KDF). In order to better fit the observed distribution of the between-source variation, this paper presents a different approach in which a Gaussian mixture model (GMM) is used instead of a KDF. As it will be shown, this approach provides better-calibrated likelihood ratios as measured by the log-likelihood ratio cost (Cllr) in experiments performed on freely available forensic datasets involving different trace evidences: inks, glass fragments and car paints.
url http://europepmc.org/articles/PMC4762660?pdf=render
work_keys_str_mv AT javierfrancopedroso gaussianmixturemodelsofbetweensourcevariationforlikelihoodratiocomputationfrommultivariatedata
AT danielramos gaussianmixturemodelsofbetweensourcevariationforlikelihoodratiocomputationfrommultivariatedata
AT joaquingonzalezrodriguez gaussianmixturemodelsofbetweensourcevariationforlikelihoodratiocomputationfrommultivariatedata
_version_ 1716803367862272000