Learning Genetic Population Structures Using Minimization of Stochastic Complexity
Considerable research efforts have been devoted to probabilistic modeling of genetic population structures within the past decade. In particular, a wide spectrum of Bayesian models have been proposed for unlinked molecular marker data from diploid organisms. Here we derive a theoretical framework fo...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2010-05-01
|
Series: | Entropy |
Subjects: | |
Online Access: | http://www.mdpi.com/1099-4300/12/5/1102/ |
id |
doaj-a3b926682f954850a10a96be8c5da5d6 |
---|---|
record_format |
Article |
spelling |
doaj-a3b926682f954850a10a96be8c5da5d62020-11-24T21:30:07ZengMDPI AGEntropy1099-43002010-05-011251102112410.3390/e12051102Learning Genetic Population Structures Using Minimization of Stochastic ComplexityJukka CoranderMats GyllenbergTimo KoskiConsiderable research efforts have been devoted to probabilistic modeling of genetic population structures within the past decade. In particular, a wide spectrum of Bayesian models have been proposed for unlinked molecular marker data from diploid organisms. Here we derive a theoretical framework for learning genetic population structure of a haploid organism from bi-allelic markers for which potential patterns of dependence are a priori unknown and to be explicitly incorporated in the model. Our framework is based on the principle of minimizing stochastic complexity of an unsupervised classification under tree augmented factorization of the predictive data distribution. We discuss a fast implementation of the learning framework using deterministic algorithms. http://www.mdpi.com/1099-4300/12/5/1102/factorization of multivariate distributionsfinite mixture modelsMinimum Description Lengthpopulation geneticsstatistical learningstructured population |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jukka Corander Mats Gyllenberg Timo Koski |
spellingShingle |
Jukka Corander Mats Gyllenberg Timo Koski Learning Genetic Population Structures Using Minimization of Stochastic Complexity Entropy factorization of multivariate distributions finite mixture models Minimum Description Length population genetics statistical learning structured population |
author_facet |
Jukka Corander Mats Gyllenberg Timo Koski |
author_sort |
Jukka Corander |
title |
Learning Genetic Population Structures Using Minimization of Stochastic Complexity |
title_short |
Learning Genetic Population Structures Using Minimization of Stochastic Complexity |
title_full |
Learning Genetic Population Structures Using Minimization of Stochastic Complexity |
title_fullStr |
Learning Genetic Population Structures Using Minimization of Stochastic Complexity |
title_full_unstemmed |
Learning Genetic Population Structures Using Minimization of Stochastic Complexity |
title_sort |
learning genetic population structures using minimization of stochastic complexity |
publisher |
MDPI AG |
series |
Entropy |
issn |
1099-4300 |
publishDate |
2010-05-01 |
description |
Considerable research efforts have been devoted to probabilistic modeling of genetic population structures within the past decade. In particular, a wide spectrum of Bayesian models have been proposed for unlinked molecular marker data from diploid organisms. Here we derive a theoretical framework for learning genetic population structure of a haploid organism from bi-allelic markers for which potential patterns of dependence are a priori unknown and to be explicitly incorporated in the model. Our framework is based on the principle of minimizing stochastic complexity of an unsupervised classification under tree augmented factorization of the predictive data distribution. We discuss a fast implementation of the learning framework using deterministic algorithms. |
topic |
factorization of multivariate distributions finite mixture models Minimum Description Length population genetics statistical learning structured population |
url |
http://www.mdpi.com/1099-4300/12/5/1102/ |
work_keys_str_mv |
AT jukkacorander learninggeneticpopulationstructuresusingminimizationofstochasticcomplexity AT matsgyllenberg learninggeneticpopulationstructuresusingminimizationofstochasticcomplexity AT timokoski learninggeneticpopulationstructuresusingminimizationofstochasticcomplexity |
_version_ |
1725963800785453056 |