Exploiting gene families for phylogenomic analysis of myzostomid transcriptome data.

BACKGROUND: In trying to understand the evolutionary relationships of organisms, the current flood of sequence data offers great opportunities, but also reveals new challenges with regard to data quality, the selection of data for subsequent analysis, and the automation of steps that were once done...

Full description

Bibliographic Details
Main Authors: Stefanie Hartmann, Conrad Helm, Birgit Nickel, Matthias Meyer, Torsten H Struck, Ralph Tiedemann, Joachim Selbig, Christoph Bleidorn
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2012-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3262807?pdf=render
id doaj-bb96b32e1277426c94fe11e9ad59eda9
record_format Article
spelling doaj-bb96b32e1277426c94fe11e9ad59eda92020-11-25T02:16:00ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0171e2984310.1371/journal.pone.0029843Exploiting gene families for phylogenomic analysis of myzostomid transcriptome data.Stefanie HartmannConrad HelmBirgit NickelMatthias MeyerTorsten H StruckRalph TiedemannJoachim SelbigChristoph BleidornBACKGROUND: In trying to understand the evolutionary relationships of organisms, the current flood of sequence data offers great opportunities, but also reveals new challenges with regard to data quality, the selection of data for subsequent analysis, and the automation of steps that were once done manually for single-gene analyses. Even though genome or transcriptome data is available for representatives of most bilaterian phyla, some enigmatic taxa still have an uncertain position in the animal tree of life. This is especially true for myzostomids, a group of symbiotic (or parasitic) protostomes that are either placed with annelids or flatworms. METHODOLOGY: Based on similarity criteria, Illumina-based transcriptome sequences of one myzostomid were compared to protein sequences of one additional myzostomid and 29 reference metazoa and clustered into gene families. These families were then used to investigate the phylogenetic position of Myzostomida using different approaches: Alignments of 989 sequence families were concatenated, and the resulting superalignment was analyzed under a Maximum Likelihood criterion. We also used all 1,878 gene trees with at least one myzostomid sequence for a supertree approach: the individual gene trees were computed and then reconciled into a species tree using gene tree parsimony. CONCLUSIONS: Superalignments require strictly orthologous genes, and both the gene selection and the widely varying amount of data available for different taxa in our dataset may cause anomalous placements and low bootstrap support. In contrast, gene tree parsimony is designed to accommodate multilocus gene families and therefore allows a much more comprehensive data set to be analyzed. Results of this supertree approach showed a well-resolved phylogeny, in which myzostomids were part of the annelid radiation, and major bilaterian taxa were found to be monophyletic.http://europepmc.org/articles/PMC3262807?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Stefanie Hartmann
Conrad Helm
Birgit Nickel
Matthias Meyer
Torsten H Struck
Ralph Tiedemann
Joachim Selbig
Christoph Bleidorn
spellingShingle Stefanie Hartmann
Conrad Helm
Birgit Nickel
Matthias Meyer
Torsten H Struck
Ralph Tiedemann
Joachim Selbig
Christoph Bleidorn
Exploiting gene families for phylogenomic analysis of myzostomid transcriptome data.
PLoS ONE
author_facet Stefanie Hartmann
Conrad Helm
Birgit Nickel
Matthias Meyer
Torsten H Struck
Ralph Tiedemann
Joachim Selbig
Christoph Bleidorn
author_sort Stefanie Hartmann
title Exploiting gene families for phylogenomic analysis of myzostomid transcriptome data.
title_short Exploiting gene families for phylogenomic analysis of myzostomid transcriptome data.
title_full Exploiting gene families for phylogenomic analysis of myzostomid transcriptome data.
title_fullStr Exploiting gene families for phylogenomic analysis of myzostomid transcriptome data.
title_full_unstemmed Exploiting gene families for phylogenomic analysis of myzostomid transcriptome data.
title_sort exploiting gene families for phylogenomic analysis of myzostomid transcriptome data.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2012-01-01
description BACKGROUND: In trying to understand the evolutionary relationships of organisms, the current flood of sequence data offers great opportunities, but also reveals new challenges with regard to data quality, the selection of data for subsequent analysis, and the automation of steps that were once done manually for single-gene analyses. Even though genome or transcriptome data is available for representatives of most bilaterian phyla, some enigmatic taxa still have an uncertain position in the animal tree of life. This is especially true for myzostomids, a group of symbiotic (or parasitic) protostomes that are either placed with annelids or flatworms. METHODOLOGY: Based on similarity criteria, Illumina-based transcriptome sequences of one myzostomid were compared to protein sequences of one additional myzostomid and 29 reference metazoa and clustered into gene families. These families were then used to investigate the phylogenetic position of Myzostomida using different approaches: Alignments of 989 sequence families were concatenated, and the resulting superalignment was analyzed under a Maximum Likelihood criterion. We also used all 1,878 gene trees with at least one myzostomid sequence for a supertree approach: the individual gene trees were computed and then reconciled into a species tree using gene tree parsimony. CONCLUSIONS: Superalignments require strictly orthologous genes, and both the gene selection and the widely varying amount of data available for different taxa in our dataset may cause anomalous placements and low bootstrap support. In contrast, gene tree parsimony is designed to accommodate multilocus gene families and therefore allows a much more comprehensive data set to be analyzed. Results of this supertree approach showed a well-resolved phylogeny, in which myzostomids were part of the annelid radiation, and major bilaterian taxa were found to be monophyletic.
url http://europepmc.org/articles/PMC3262807?pdf=render
work_keys_str_mv AT stefaniehartmann exploitinggenefamiliesforphylogenomicanalysisofmyzostomidtranscriptomedata
AT conradhelm exploitinggenefamiliesforphylogenomicanalysisofmyzostomidtranscriptomedata
AT birgitnickel exploitinggenefamiliesforphylogenomicanalysisofmyzostomidtranscriptomedata
AT matthiasmeyer exploitinggenefamiliesforphylogenomicanalysisofmyzostomidtranscriptomedata
AT torstenhstruck exploitinggenefamiliesforphylogenomicanalysisofmyzostomidtranscriptomedata
AT ralphtiedemann exploitinggenefamiliesforphylogenomicanalysisofmyzostomidtranscriptomedata
AT joachimselbig exploitinggenefamiliesforphylogenomicanalysisofmyzostomidtranscriptomedata
AT christophbleidorn exploitinggenefamiliesforphylogenomicanalysisofmyzostomidtranscriptomedata
_version_ 1724893416750317568