Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices.
Over 3000 microbial (bacterial and archaeal) genomes have been made publically available to date, providing an unprecedented opportunity to examine evolutionary genomic trends and offering valuable reference data for a variety of other studies such as metagenomics. The utility of these genome sequen...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2013-01-01
|
Series: | PLoS ONE |
Online Access: | https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23638103/?tool=EBI |
id |
doaj-512400c595964b93a45603cd8e1dcb50 |
---|---|
record_format |
Article |
spelling |
doaj-512400c595964b93a45603cd8e1dcb502021-03-03T23:25:35ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-0184e6251010.1371/journal.pone.0062510Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices.Jenna Morgan LangAaron E DarlingJonathan A EisenOver 3000 microbial (bacterial and archaeal) genomes have been made publically available to date, providing an unprecedented opportunity to examine evolutionary genomic trends and offering valuable reference data for a variety of other studies such as metagenomics. The utility of these genome sequences is greatly enhanced when we have an understanding of how they are phylogenetically related to each other. Therefore, we here describe our efforts to reconstruct the phylogeny of all available bacterial and archaeal genomes. We identified 24, single-copy, ubiquitous genes suitable for this phylogenetic analysis. We used two approaches to combine the data for the 24 genes. First, we concatenated alignments of all genes into a single alignment from which a Maximum Likelihood (ML) tree was inferred using RAxML. Second, we used a relatively new approach to combining gene data, Bayesian Concordance Analysis (BCA), as implemented in the BUCKy software, in which the results of 24 single-gene phylogenetic analyses are used to generate a "primary concordance" tree. A comparison of the concatenated ML tree and the primary concordance (BUCKy) tree reveals that the two approaches give similar results, relative to a phylogenetic tree inferred from the 16S rRNA gene. After comparing the results and the methods used, we conclude that the current best approach for generating a single phylogenetic tree, suitable for use as a reference phylogeny for comparative analyses, is to perform a maximum likelihood analysis of a concatenated alignment of conserved, single-copy genes.https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23638103/?tool=EBI |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jenna Morgan Lang Aaron E Darling Jonathan A Eisen |
spellingShingle |
Jenna Morgan Lang Aaron E Darling Jonathan A Eisen Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices. PLoS ONE |
author_facet |
Jenna Morgan Lang Aaron E Darling Jonathan A Eisen |
author_sort |
Jenna Morgan Lang |
title |
Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices. |
title_short |
Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices. |
title_full |
Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices. |
title_fullStr |
Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices. |
title_full_unstemmed |
Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices. |
title_sort |
phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2013-01-01 |
description |
Over 3000 microbial (bacterial and archaeal) genomes have been made publically available to date, providing an unprecedented opportunity to examine evolutionary genomic trends and offering valuable reference data for a variety of other studies such as metagenomics. The utility of these genome sequences is greatly enhanced when we have an understanding of how they are phylogenetically related to each other. Therefore, we here describe our efforts to reconstruct the phylogeny of all available bacterial and archaeal genomes. We identified 24, single-copy, ubiquitous genes suitable for this phylogenetic analysis. We used two approaches to combine the data for the 24 genes. First, we concatenated alignments of all genes into a single alignment from which a Maximum Likelihood (ML) tree was inferred using RAxML. Second, we used a relatively new approach to combining gene data, Bayesian Concordance Analysis (BCA), as implemented in the BUCKy software, in which the results of 24 single-gene phylogenetic analyses are used to generate a "primary concordance" tree. A comparison of the concatenated ML tree and the primary concordance (BUCKy) tree reveals that the two approaches give similar results, relative to a phylogenetic tree inferred from the 16S rRNA gene. After comparing the results and the methods used, we conclude that the current best approach for generating a single phylogenetic tree, suitable for use as a reference phylogeny for comparative analyses, is to perform a maximum likelihood analysis of a concatenated alignment of conserved, single-copy genes. |
url |
https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23638103/?tool=EBI |
work_keys_str_mv |
AT jennamorganlang phylogenyofbacterialandarchaealgenomesusingconservedgenessupertreesandsupermatrices AT aaronedarling phylogenyofbacterialandarchaealgenomesusingconservedgenessupertreesandsupermatrices AT jonathanaeisen phylogenyofbacterialandarchaealgenomesusingconservedgenessupertreesandsupermatrices |
_version_ |
1714811518599561216 |