Accuracy of phylogeny reconstruction methods combining overlapping gene data sets

<p>Abstract</p> <p>Background</p> <p>The availability of many gene alignments with overlapping taxon sets raises the question of which strategy is the best to infer species phylogenies from multiple gene information. Methods and programs abound that use the gene alignme...

Full description

Bibliographic Details
Main Authors: von Haeseler Arndt, Schmidt Heiko A, Kupczok Anne
Format: Article
Language:English
Published: BMC 2010-12-01
Series:Algorithms for Molecular Biology
Online Access:http://www.almob.org/content/5/1/37
id doaj-c2d520e9ea7c424782d89519606ea5db
record_format Article
spelling doaj-c2d520e9ea7c424782d89519606ea5db2020-11-24T21:02:02ZengBMCAlgorithms for Molecular Biology1748-71882010-12-01513710.1186/1748-7188-5-37Accuracy of phylogeny reconstruction methods combining overlapping gene data setsvon Haeseler ArndtSchmidt Heiko AKupczok Anne<p>Abstract</p> <p>Background</p> <p>The availability of many gene alignments with overlapping taxon sets raises the question of which strategy is the best to infer species phylogenies from multiple gene information. Methods and programs abound that use the gene alignment in different ways to reconstruct the species tree. In particular, different methods combine the original data at different points along the way from the underlying sequences to the final tree. Accordingly, they are classified into superalignment, supertree and medium-level approaches. Here, we present a simulation study to compare different methods from each of these three approaches.</p> <p>Results</p> <p>We observe that superalignment methods usually outperform the other approaches over a wide range of parameters including sparse data and gene-specific evolutionary parameters. In the presence of high incongruency among gene trees, however, other combination methods show better performance than the superalignment approach. Surprisingly, some supertree and medium-level methods exhibit, on average, worse results than a single gene phylogeny with complete taxon information.</p> <p>Conclusions</p> <p>For some methods, using the reconstructed gene tree as an estimation of the species tree is superior to the combination of incomplete information. Superalignment usually performs best since it is less susceptible to stochastic error. Supertree methods can outperform superalignment in the presence of gene-tree conflict.</p> http://www.almob.org/content/5/1/37
collection DOAJ
language English
format Article
sources DOAJ
author von Haeseler Arndt
Schmidt Heiko A
Kupczok Anne
spellingShingle von Haeseler Arndt
Schmidt Heiko A
Kupczok Anne
Accuracy of phylogeny reconstruction methods combining overlapping gene data sets
Algorithms for Molecular Biology
author_facet von Haeseler Arndt
Schmidt Heiko A
Kupczok Anne
author_sort von Haeseler Arndt
title Accuracy of phylogeny reconstruction methods combining overlapping gene data sets
title_short Accuracy of phylogeny reconstruction methods combining overlapping gene data sets
title_full Accuracy of phylogeny reconstruction methods combining overlapping gene data sets
title_fullStr Accuracy of phylogeny reconstruction methods combining overlapping gene data sets
title_full_unstemmed Accuracy of phylogeny reconstruction methods combining overlapping gene data sets
title_sort accuracy of phylogeny reconstruction methods combining overlapping gene data sets
publisher BMC
series Algorithms for Molecular Biology
issn 1748-7188
publishDate 2010-12-01
description <p>Abstract</p> <p>Background</p> <p>The availability of many gene alignments with overlapping taxon sets raises the question of which strategy is the best to infer species phylogenies from multiple gene information. Methods and programs abound that use the gene alignment in different ways to reconstruct the species tree. In particular, different methods combine the original data at different points along the way from the underlying sequences to the final tree. Accordingly, they are classified into superalignment, supertree and medium-level approaches. Here, we present a simulation study to compare different methods from each of these three approaches.</p> <p>Results</p> <p>We observe that superalignment methods usually outperform the other approaches over a wide range of parameters including sparse data and gene-specific evolutionary parameters. In the presence of high incongruency among gene trees, however, other combination methods show better performance than the superalignment approach. Surprisingly, some supertree and medium-level methods exhibit, on average, worse results than a single gene phylogeny with complete taxon information.</p> <p>Conclusions</p> <p>For some methods, using the reconstructed gene tree as an estimation of the species tree is superior to the combination of incomplete information. Superalignment usually performs best since it is less susceptible to stochastic error. Supertree methods can outperform superalignment in the presence of gene-tree conflict.</p>
url http://www.almob.org/content/5/1/37
work_keys_str_mv AT vonhaeselerarndt accuracyofphylogenyreconstructionmethodscombiningoverlappinggenedatasets
AT schmidtheikoa accuracyofphylogenyreconstructionmethodscombiningoverlappinggenedatasets
AT kupczokanne accuracyofphylogenyreconstructionmethodscombiningoverlappinggenedatasets
_version_ 1716776879803858944