Learning Genetic Population Structures Using Minimization of Stochastic Complexity

Considerable research efforts have been devoted to probabilistic modeling of genetic population structures within the past decade. In particular, a wide spectrum of Bayesian models have been proposed for unlinked molecular marker data from diploid organisms. Here we derive a theoretical framework fo...

Full description

Bibliographic Details
Main Authors: Jukka Corander, Mats Gyllenberg, Timo Koski
Format: Article
Language:English
Published: MDPI AG 2010-05-01
Series:Entropy
Subjects:
Online Access:http://www.mdpi.com/1099-4300/12/5/1102/
id doaj-a3b926682f954850a10a96be8c5da5d6
record_format Article
spelling doaj-a3b926682f954850a10a96be8c5da5d62020-11-24T21:30:07ZengMDPI AGEntropy1099-43002010-05-011251102112410.3390/e12051102Learning Genetic Population Structures Using Minimization of Stochastic ComplexityJukka CoranderMats GyllenbergTimo KoskiConsiderable research efforts have been devoted to probabilistic modeling of genetic population structures within the past decade. In particular, a wide spectrum of Bayesian models have been proposed for unlinked molecular marker data from diploid organisms. Here we derive a theoretical framework for learning genetic population structure of a haploid organism from bi-allelic markers for which potential patterns of dependence are a priori unknown and to be explicitly incorporated in the model. Our framework is based on the principle of minimizing stochastic complexity of an unsupervised classification under tree augmented factorization of the predictive data distribution. We discuss a fast implementation of the learning framework using deterministic algorithms. http://www.mdpi.com/1099-4300/12/5/1102/factorization of multivariate distributionsfinite mixture modelsMinimum Description Lengthpopulation geneticsstatistical learningstructured population
collection DOAJ
language English
format Article
sources DOAJ
author Jukka Corander
Mats Gyllenberg
Timo Koski
spellingShingle Jukka Corander
Mats Gyllenberg
Timo Koski
Learning Genetic Population Structures Using Minimization of Stochastic Complexity
Entropy
factorization of multivariate distributions
finite mixture models
Minimum Description Length
population genetics
statistical learning
structured population
author_facet Jukka Corander
Mats Gyllenberg
Timo Koski
author_sort Jukka Corander
title Learning Genetic Population Structures Using Minimization of Stochastic Complexity
title_short Learning Genetic Population Structures Using Minimization of Stochastic Complexity
title_full Learning Genetic Population Structures Using Minimization of Stochastic Complexity
title_fullStr Learning Genetic Population Structures Using Minimization of Stochastic Complexity
title_full_unstemmed Learning Genetic Population Structures Using Minimization of Stochastic Complexity
title_sort learning genetic population structures using minimization of stochastic complexity
publisher MDPI AG
series Entropy
issn 1099-4300
publishDate 2010-05-01
description Considerable research efforts have been devoted to probabilistic modeling of genetic population structures within the past decade. In particular, a wide spectrum of Bayesian models have been proposed for unlinked molecular marker data from diploid organisms. Here we derive a theoretical framework for learning genetic population structure of a haploid organism from bi-allelic markers for which potential patterns of dependence are a priori unknown and to be explicitly incorporated in the model. Our framework is based on the principle of minimizing stochastic complexity of an unsupervised classification under tree augmented factorization of the predictive data distribution. We discuss a fast implementation of the learning framework using deterministic algorithms.
topic factorization of multivariate distributions
finite mixture models
Minimum Description Length
population genetics
statistical learning
structured population
url http://www.mdpi.com/1099-4300/12/5/1102/
work_keys_str_mv AT jukkacorander learninggeneticpopulationstructuresusingminimizationofstochasticcomplexity
AT matsgyllenberg learninggeneticpopulationstructuresusingminimizationofstochasticcomplexity
AT timokoski learninggeneticpopulationstructuresusingminimizationofstochasticcomplexity
_version_ 1725963800785453056