Global patterns of protein domain gain and loss in superkingdoms.

Domains are modules within proteins that can fold and function independently and are evolutionarily conserved. Here we compared the usage and distribution of protein domain families in the free-living proteomes of Archaea, Bacteria and Eukarya and reconstructed species phylogenies while tracing the...

Full description

Bibliographic Details
Main Authors: Arshan Nasir, Kyung Mo Kim, Gustavo Caetano-Anollés
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC3907288?pdf=render
id doaj-d472ac27459d43859666514c24c91e3b
record_format Article
spelling doaj-d472ac27459d43859666514c24c91e3b2020-11-25T01:11:55ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582014-01-01101e100345210.1371/journal.pcbi.1003452Global patterns of protein domain gain and loss in superkingdoms.Arshan NasirKyung Mo KimGustavo Caetano-AnollésDomains are modules within proteins that can fold and function independently and are evolutionarily conserved. Here we compared the usage and distribution of protein domain families in the free-living proteomes of Archaea, Bacteria and Eukarya and reconstructed species phylogenies while tracing the history of domain emergence and loss in proteomes. We show that both gains and losses of domains occurred frequently during proteome evolution. The rate of domain discovery increased approximately linearly in evolutionary time. Remarkably, gains generally outnumbered losses and the gain-to-loss ratios were much higher in akaryotes compared to eukaryotes. Functional annotations of domain families revealed that both Archaea and Bacteria gained and lost metabolic capabilities during the course of evolution while Eukarya acquired a number of diverse molecular functions including those involved in extracellular processes, immunological mechanisms, and cell regulation. Results also highlighted significant contemporary sharing of informational enzymes between Archaea and Eukarya and metabolic enzymes between Bacteria and Eukarya. Finally, the analysis provided useful insights into the evolution of species. The archaeal superkingdom appeared first in evolution by gradual loss of ancestral domains, bacterial lineages were the first to gain superkingdom-specific domains, and eukaryotes (likely) originated when an expanding proto-eukaryotic stem lineage gained organelles through endosymbiosis of already diversified bacterial lineages. The evolutionary dynamics of domain families in proteomes and the increasing number of domain gains is predicted to redefine the persistence strategies of organisms in superkingdoms, influence the make up of molecular functions, and enhance organismal complexity by the generation of new domain architectures. This dynamics highlights ongoing secondary evolutionary adaptations in akaryotic microbes, especially Archaea.http://europepmc.org/articles/PMC3907288?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Arshan Nasir
Kyung Mo Kim
Gustavo Caetano-Anollés
spellingShingle Arshan Nasir
Kyung Mo Kim
Gustavo Caetano-Anollés
Global patterns of protein domain gain and loss in superkingdoms.
PLoS Computational Biology
author_facet Arshan Nasir
Kyung Mo Kim
Gustavo Caetano-Anollés
author_sort Arshan Nasir
title Global patterns of protein domain gain and loss in superkingdoms.
title_short Global patterns of protein domain gain and loss in superkingdoms.
title_full Global patterns of protein domain gain and loss in superkingdoms.
title_fullStr Global patterns of protein domain gain and loss in superkingdoms.
title_full_unstemmed Global patterns of protein domain gain and loss in superkingdoms.
title_sort global patterns of protein domain gain and loss in superkingdoms.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2014-01-01
description Domains are modules within proteins that can fold and function independently and are evolutionarily conserved. Here we compared the usage and distribution of protein domain families in the free-living proteomes of Archaea, Bacteria and Eukarya and reconstructed species phylogenies while tracing the history of domain emergence and loss in proteomes. We show that both gains and losses of domains occurred frequently during proteome evolution. The rate of domain discovery increased approximately linearly in evolutionary time. Remarkably, gains generally outnumbered losses and the gain-to-loss ratios were much higher in akaryotes compared to eukaryotes. Functional annotations of domain families revealed that both Archaea and Bacteria gained and lost metabolic capabilities during the course of evolution while Eukarya acquired a number of diverse molecular functions including those involved in extracellular processes, immunological mechanisms, and cell regulation. Results also highlighted significant contemporary sharing of informational enzymes between Archaea and Eukarya and metabolic enzymes between Bacteria and Eukarya. Finally, the analysis provided useful insights into the evolution of species. The archaeal superkingdom appeared first in evolution by gradual loss of ancestral domains, bacterial lineages were the first to gain superkingdom-specific domains, and eukaryotes (likely) originated when an expanding proto-eukaryotic stem lineage gained organelles through endosymbiosis of already diversified bacterial lineages. The evolutionary dynamics of domain families in proteomes and the increasing number of domain gains is predicted to redefine the persistence strategies of organisms in superkingdoms, influence the make up of molecular functions, and enhance organismal complexity by the generation of new domain architectures. This dynamics highlights ongoing secondary evolutionary adaptations in akaryotic microbes, especially Archaea.
url http://europepmc.org/articles/PMC3907288?pdf=render
work_keys_str_mv AT arshannasir globalpatternsofproteindomaingainandlossinsuperkingdoms
AT kyungmokim globalpatternsofproteindomaingainandlossinsuperkingdoms
AT gustavocaetanoanolles globalpatternsofproteindomaingainandlossinsuperkingdoms
_version_ 1725168862363648000