Minimizing recombinations in consensus networks for phylogeographic studies

<p>Abstract</p> <p>Background</p> <p>We address the problem of studying recombinational variations in (human) populations. In this paper, our focus is on one computational aspect of the general task: Given two networks <it>G</it><sub>1 </sub>and...

Full description

Bibliographic Details
Main Authors: Melé Marta, Javed Asif, Parida Laxmi, Calafell Francesc, Bertranpetit Jaume
Format: Article
Language:English
Published: BMC 2009-01-01
Series:BMC Bioinformatics
id doaj-9e9b127cb0144585ac801e6313dcade9
record_format Article
spelling doaj-9e9b127cb0144585ac801e6313dcade92020-11-24T23:13:03ZengBMCBMC Bioinformatics1471-21052009-01-0110Suppl 1S7210.1186/1471-2105-10-S1-S72Minimizing recombinations in consensus networks for phylogeographic studiesMelé MartaJaved AsifParida LaxmiCalafell FrancescBertranpetit Jaume<p>Abstract</p> <p>Background</p> <p>We address the problem of studying recombinational variations in (human) populations. In this paper, our focus is on one computational aspect of the general task: Given two networks <it>G</it><sub>1 </sub>and <it>G</it><sub>2</sub>, with both mutation and recombination events, defined on overlapping sets of extant units the objective is to compute a consensus network <it>G</it><sub>3 </sub>with minimum number of additional recombinations. We describe a polynomial time algorithm with a guarantee that the number of computed new recombination events is within <it>ϵ </it>= <it>sz</it>(<it>G</it><sub>1</sub>, <it>G</it><sub>2</sub>) (function <it>sz </it>is a well-behaved function of the sizes and topologies of <it>G</it><sub>1 </sub>and <it>G</it><sub>2</sub>) of the optimal <it>number </it>of recombinations. To date, this is the best known result for a network consensus problem.</p> <p>Results</p> <p>Although the network consensus problem can be applied to a variety of domains, here we focus on structure of human populations. With our preliminary analysis on a segment of the human Chromosome X data we are able to infer ancient recombinations, population-specific recombinations and more, which also support the widely accepted 'Out of Africa' model. These results have been verified independently using traditional manual procedures. To the best of our knowledge, this is the first recombinations-based characterization of human populations.</p> <p>Conclusion</p> <p>We show that our mathematical model identifies recombination spots in the individual haplotypes; the aggregate of these spots over a set of haplotypes defines a recombinational landscape that has enough signal to detect continental as well as population divide based on a short segment of Chromosome X. In particular, we are able to infer ancient recombinations, population-specific recombinations and more, which also support the widely accepted 'Out of Africa' model. The agreement with mutation-based analysis can be viewed as an indirect validation of our results and the model. Since the model in principle gives us more information embedded in the networks, in our future work, we plan to investigate more non-traditional questions via these structures computed by our methodology.</p>
collection DOAJ
language English
format Article
sources DOAJ
author Melé Marta
Javed Asif
Parida Laxmi
Calafell Francesc
Bertranpetit Jaume
spellingShingle Melé Marta
Javed Asif
Parida Laxmi
Calafell Francesc
Bertranpetit Jaume
Minimizing recombinations in consensus networks for phylogeographic studies
BMC Bioinformatics
author_facet Melé Marta
Javed Asif
Parida Laxmi
Calafell Francesc
Bertranpetit Jaume
author_sort Melé Marta
title Minimizing recombinations in consensus networks for phylogeographic studies
title_short Minimizing recombinations in consensus networks for phylogeographic studies
title_full Minimizing recombinations in consensus networks for phylogeographic studies
title_fullStr Minimizing recombinations in consensus networks for phylogeographic studies
title_full_unstemmed Minimizing recombinations in consensus networks for phylogeographic studies
title_sort minimizing recombinations in consensus networks for phylogeographic studies
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2009-01-01
description <p>Abstract</p> <p>Background</p> <p>We address the problem of studying recombinational variations in (human) populations. In this paper, our focus is on one computational aspect of the general task: Given two networks <it>G</it><sub>1 </sub>and <it>G</it><sub>2</sub>, with both mutation and recombination events, defined on overlapping sets of extant units the objective is to compute a consensus network <it>G</it><sub>3 </sub>with minimum number of additional recombinations. We describe a polynomial time algorithm with a guarantee that the number of computed new recombination events is within <it>ϵ </it>= <it>sz</it>(<it>G</it><sub>1</sub>, <it>G</it><sub>2</sub>) (function <it>sz </it>is a well-behaved function of the sizes and topologies of <it>G</it><sub>1 </sub>and <it>G</it><sub>2</sub>) of the optimal <it>number </it>of recombinations. To date, this is the best known result for a network consensus problem.</p> <p>Results</p> <p>Although the network consensus problem can be applied to a variety of domains, here we focus on structure of human populations. With our preliminary analysis on a segment of the human Chromosome X data we are able to infer ancient recombinations, population-specific recombinations and more, which also support the widely accepted 'Out of Africa' model. These results have been verified independently using traditional manual procedures. To the best of our knowledge, this is the first recombinations-based characterization of human populations.</p> <p>Conclusion</p> <p>We show that our mathematical model identifies recombination spots in the individual haplotypes; the aggregate of these spots over a set of haplotypes defines a recombinational landscape that has enough signal to detect continental as well as population divide based on a short segment of Chromosome X. In particular, we are able to infer ancient recombinations, population-specific recombinations and more, which also support the widely accepted 'Out of Africa' model. The agreement with mutation-based analysis can be viewed as an indirect validation of our results and the model. Since the model in principle gives us more information embedded in the networks, in our future work, we plan to investigate more non-traditional questions via these structures computed by our methodology.</p>
work_keys_str_mv AT melemarta minimizingrecombinationsinconsensusnetworksforphylogeographicstudies
AT javedasif minimizingrecombinationsinconsensusnetworksforphylogeographicstudies
AT paridalaxmi minimizingrecombinationsinconsensusnetworksforphylogeographicstudies
AT calafellfrancesc minimizingrecombinationsinconsensusnetworksforphylogeographicstudies
AT bertranpetitjaume minimizingrecombinationsinconsensusnetworksforphylogeographicstudies
_version_ 1725599631843262464