Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices.

Over 3000 microbial (bacterial and archaeal) genomes have been made publically available to date, providing an unprecedented opportunity to examine evolutionary genomic trends and offering valuable reference data for a variety of other studies such as metagenomics. The utility of these genome sequen...

Full description

Bibliographic Details
Main Authors: Jenna Morgan Lang, Aaron E Darling, Jonathan A Eisen
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS ONE
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23638103/?tool=EBI
id doaj-512400c595964b93a45603cd8e1dcb50
record_format Article
spelling doaj-512400c595964b93a45603cd8e1dcb502021-03-03T23:25:35ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-0184e6251010.1371/journal.pone.0062510Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices.Jenna Morgan LangAaron E DarlingJonathan A EisenOver 3000 microbial (bacterial and archaeal) genomes have been made publically available to date, providing an unprecedented opportunity to examine evolutionary genomic trends and offering valuable reference data for a variety of other studies such as metagenomics. The utility of these genome sequences is greatly enhanced when we have an understanding of how they are phylogenetically related to each other. Therefore, we here describe our efforts to reconstruct the phylogeny of all available bacterial and archaeal genomes. We identified 24, single-copy, ubiquitous genes suitable for this phylogenetic analysis. We used two approaches to combine the data for the 24 genes. First, we concatenated alignments of all genes into a single alignment from which a Maximum Likelihood (ML) tree was inferred using RAxML. Second, we used a relatively new approach to combining gene data, Bayesian Concordance Analysis (BCA), as implemented in the BUCKy software, in which the results of 24 single-gene phylogenetic analyses are used to generate a "primary concordance" tree. A comparison of the concatenated ML tree and the primary concordance (BUCKy) tree reveals that the two approaches give similar results, relative to a phylogenetic tree inferred from the 16S rRNA gene. After comparing the results and the methods used, we conclude that the current best approach for generating a single phylogenetic tree, suitable for use as a reference phylogeny for comparative analyses, is to perform a maximum likelihood analysis of a concatenated alignment of conserved, single-copy genes.https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23638103/?tool=EBI
collection DOAJ
language English
format Article
sources DOAJ
author Jenna Morgan Lang
Aaron E Darling
Jonathan A Eisen
spellingShingle Jenna Morgan Lang
Aaron E Darling
Jonathan A Eisen
Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices.
PLoS ONE
author_facet Jenna Morgan Lang
Aaron E Darling
Jonathan A Eisen
author_sort Jenna Morgan Lang
title Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices.
title_short Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices.
title_full Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices.
title_fullStr Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices.
title_full_unstemmed Phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices.
title_sort phylogeny of bacterial and archaeal genomes using conserved genes: supertrees and supermatrices.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2013-01-01
description Over 3000 microbial (bacterial and archaeal) genomes have been made publically available to date, providing an unprecedented opportunity to examine evolutionary genomic trends and offering valuable reference data for a variety of other studies such as metagenomics. The utility of these genome sequences is greatly enhanced when we have an understanding of how they are phylogenetically related to each other. Therefore, we here describe our efforts to reconstruct the phylogeny of all available bacterial and archaeal genomes. We identified 24, single-copy, ubiquitous genes suitable for this phylogenetic analysis. We used two approaches to combine the data for the 24 genes. First, we concatenated alignments of all genes into a single alignment from which a Maximum Likelihood (ML) tree was inferred using RAxML. Second, we used a relatively new approach to combining gene data, Bayesian Concordance Analysis (BCA), as implemented in the BUCKy software, in which the results of 24 single-gene phylogenetic analyses are used to generate a "primary concordance" tree. A comparison of the concatenated ML tree and the primary concordance (BUCKy) tree reveals that the two approaches give similar results, relative to a phylogenetic tree inferred from the 16S rRNA gene. After comparing the results and the methods used, we conclude that the current best approach for generating a single phylogenetic tree, suitable for use as a reference phylogeny for comparative analyses, is to perform a maximum likelihood analysis of a concatenated alignment of conserved, single-copy genes.
url https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23638103/?tool=EBI
work_keys_str_mv AT jennamorganlang phylogenyofbacterialandarchaealgenomesusingconservedgenessupertreesandsupermatrices
AT aaronedarling phylogenyofbacterialandarchaealgenomesusingconservedgenessupertreesandsupermatrices
AT jonathanaeisen phylogenyofbacterialandarchaealgenomesusingconservedgenessupertreesandsupermatrices
_version_ 1714811518599561216