tRNA signatures reveal a polyphyletic origin of SAR11 strains among alphaproteobacteria.

Molecular phylogenetics and phylogenomics are subject to noise from horizontal gene transfer (HGT) and bias from convergence in macromolecular compositions. Extensive variation in size, structure and base composition of alphaproteobacterial genomes has complicated their phylogenomics, sparking contr...

Full description

Bibliographic Details
Main Authors: Katherine C H Amrine, Wesley D Swingley, David H Ardell
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-02-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC3937112?pdf=render
id doaj-c164ee64e4f64689bfe4086374a06188
record_format Article
spelling doaj-c164ee64e4f64689bfe4086374a061882020-11-24T21:49:06ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582014-02-01102e100345410.1371/journal.pcbi.1003454tRNA signatures reveal a polyphyletic origin of SAR11 strains among alphaproteobacteria.Katherine C H AmrineWesley D SwingleyDavid H ArdellMolecular phylogenetics and phylogenomics are subject to noise from horizontal gene transfer (HGT) and bias from convergence in macromolecular compositions. Extensive variation in size, structure and base composition of alphaproteobacterial genomes has complicated their phylogenomics, sparking controversy over the origins and closest relatives of the SAR11 strains. SAR11 are highly abundant, cosmopolitan aquatic Alphaproteobacteria with streamlined, A+T-biased genomes. A dominant view holds that SAR11 are monophyletic and related to both Rickettsiales and the ancestor of mitochondria. Other studies dispute this, finding evidence of a polyphyletic origin of SAR11 with most strains distantly related to Rickettsiales. Although careful evolutionary modeling can reduce bias and noise in phylogenomic inference, entirely different approaches may be useful to extract robust phylogenetic signals from genomes. Here we develop simple phyloclassifiers from bioinformatically derived tRNA Class-Informative Features (CIFs), features predicted to target tRNAs for specific interactions within the tRNA interaction network. Our tRNA CIF-based model robustly and accurately classifies alphaproteobacterial genomes into one of seven undisputed monophyletic orders or families, despite great variability in tRNA gene complement sizes and base compositions. Our model robustly rejects monophyly of SAR11, classifying all but one strain as Rhizobiales with strong statistical support. Yet remarkably, conventional phylogenetic analysis of tRNAs classifies all SAR11 strains identically as Rickettsiales. We attribute this discrepancy to convergence of SAR11 and Rickettsiales tRNA base compositions. Thus, tRNA CIFs appear more robust to compositional convergence than tRNA sequences generally. Our results suggest that tRNA-CIF-based phyloclassification is robust to HGT of components of the tRNA interaction network, such as aminoacyl-tRNA synthetases. We explain why tRNAs are especially advantageous for prediction of traits governing macromolecular interactions from genomic data, and why such traits may be advantageous in the search for robust signals to address difficult problems in classification and phylogeny.http://europepmc.org/articles/PMC3937112?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Katherine C H Amrine
Wesley D Swingley
David H Ardell
spellingShingle Katherine C H Amrine
Wesley D Swingley
David H Ardell
tRNA signatures reveal a polyphyletic origin of SAR11 strains among alphaproteobacteria.
PLoS Computational Biology
author_facet Katherine C H Amrine
Wesley D Swingley
David H Ardell
author_sort Katherine C H Amrine
title tRNA signatures reveal a polyphyletic origin of SAR11 strains among alphaproteobacteria.
title_short tRNA signatures reveal a polyphyletic origin of SAR11 strains among alphaproteobacteria.
title_full tRNA signatures reveal a polyphyletic origin of SAR11 strains among alphaproteobacteria.
title_fullStr tRNA signatures reveal a polyphyletic origin of SAR11 strains among alphaproteobacteria.
title_full_unstemmed tRNA signatures reveal a polyphyletic origin of SAR11 strains among alphaproteobacteria.
title_sort trna signatures reveal a polyphyletic origin of sar11 strains among alphaproteobacteria.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2014-02-01
description Molecular phylogenetics and phylogenomics are subject to noise from horizontal gene transfer (HGT) and bias from convergence in macromolecular compositions. Extensive variation in size, structure and base composition of alphaproteobacterial genomes has complicated their phylogenomics, sparking controversy over the origins and closest relatives of the SAR11 strains. SAR11 are highly abundant, cosmopolitan aquatic Alphaproteobacteria with streamlined, A+T-biased genomes. A dominant view holds that SAR11 are monophyletic and related to both Rickettsiales and the ancestor of mitochondria. Other studies dispute this, finding evidence of a polyphyletic origin of SAR11 with most strains distantly related to Rickettsiales. Although careful evolutionary modeling can reduce bias and noise in phylogenomic inference, entirely different approaches may be useful to extract robust phylogenetic signals from genomes. Here we develop simple phyloclassifiers from bioinformatically derived tRNA Class-Informative Features (CIFs), features predicted to target tRNAs for specific interactions within the tRNA interaction network. Our tRNA CIF-based model robustly and accurately classifies alphaproteobacterial genomes into one of seven undisputed monophyletic orders or families, despite great variability in tRNA gene complement sizes and base compositions. Our model robustly rejects monophyly of SAR11, classifying all but one strain as Rhizobiales with strong statistical support. Yet remarkably, conventional phylogenetic analysis of tRNAs classifies all SAR11 strains identically as Rickettsiales. We attribute this discrepancy to convergence of SAR11 and Rickettsiales tRNA base compositions. Thus, tRNA CIFs appear more robust to compositional convergence than tRNA sequences generally. Our results suggest that tRNA-CIF-based phyloclassification is robust to HGT of components of the tRNA interaction network, such as aminoacyl-tRNA synthetases. We explain why tRNAs are especially advantageous for prediction of traits governing macromolecular interactions from genomic data, and why such traits may be advantageous in the search for robust signals to address difficult problems in classification and phylogeny.
url http://europepmc.org/articles/PMC3937112?pdf=render
work_keys_str_mv AT katherinechamrine trnasignaturesrevealapolyphyleticoriginofsar11strainsamongalphaproteobacteria
AT wesleydswingley trnasignaturesrevealapolyphyleticoriginofsar11strainsamongalphaproteobacteria
AT davidhardell trnasignaturesrevealapolyphyleticoriginofsar11strainsamongalphaproteobacteria
_version_ 1725889544270643200