Protein fold evolution on completed genomes : distinguishing between young and old folds

We review fold usage on completed genomes in order to explore protein structure evolution and assess the evolutionary relevance of current structural classification systems (SCOP and CATH). We assign folds on a set of 150 completed genomes using fold recognition methods (PSI-BLAST, SUPERFAMILY and G...

Full description

Bibliographic Details
Main Author: Abeln, Sanne
Other Authors: Deane, Charlotte M.
Published: University of Oxford 2007
Subjects:
Online Access:http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.441063
id ndltd-bl.uk-oai-ethos.bl.uk-441063
record_format oai_dc
spelling ndltd-bl.uk-oai-ethos.bl.uk-4410632015-03-19T05:17:23ZProtein fold evolution on completed genomes : distinguishing between young and old foldsAbeln, SanneDeane, Charlotte M.2007We review fold usage on completed genomes in order to explore protein structure evolution and assess the evolutionary relevance of current structural classification systems (SCOP and CATH). We assign folds on a set of 150 completed genomes using fold recognition methods (PSI-BLAST, SUPERFAMILY and Gene3D). The patterns of presence or absence of folds on genomes gives us insights into the relationships between folds and how we have arrived at the set of folds we see today. In particular, we develop a technique to estimate the relative ages of a protein fold based on genomic occurrence patterns in a phylogeny. We find that SCOP's `alpha/beta' class has relatively fewer distinct folds on large genomes, and that folds of this class tend to be older; folds of SCOP's `small protein' class follow opposite trends. Usage patterns show that folds with many copies on a genome are generally old, but that old folds do not necessarily have many copies. In addition, longer domains tend to be older and hydrophobic amino acids have high propensities for older folds whereas, polar - but non-charged - amino acids are associated with younger folds. Generally domains with stabilising features tend to be older. We also show that the reliability of fold recognition methods may be assessed using occurrence patterns. We develop a method, that detects false positives by identifying isolated occurrences in a phylogeny of species, and is able to improve genome wide fold recognition assignment sets. We use a structural fragment library to investigate evolutionary links between protein folds. We show that 'older' folds have relatively more such links than 'younger' folds. This correlation becomes stronger for longer fragment lengths suggesting that such links may reflect evolutionary relatedness.572.633Bioinformatics (life sciences) : Protein folding : Structural genomics : Computational biochemistry : Mathematical biology : Mathematical genetics and bioinformatics (statistics) : protein structure : protein folding : protein evolutionUniversity of Oxfordhttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.441063http://ora.ox.ac.uk/objects/uuid:b520fd65-e829-4ae0-bed6-47d642909889Electronic Thesis or Dissertation
collection NDLTD
sources NDLTD
topic 572.633
Bioinformatics (life sciences) : Protein folding : Structural genomics : Computational biochemistry : Mathematical biology : Mathematical genetics and bioinformatics (statistics) : protein structure : protein folding : protein evolution
spellingShingle 572.633
Bioinformatics (life sciences) : Protein folding : Structural genomics : Computational biochemistry : Mathematical biology : Mathematical genetics and bioinformatics (statistics) : protein structure : protein folding : protein evolution
Abeln, Sanne
Protein fold evolution on completed genomes : distinguishing between young and old folds
description We review fold usage on completed genomes in order to explore protein structure evolution and assess the evolutionary relevance of current structural classification systems (SCOP and CATH). We assign folds on a set of 150 completed genomes using fold recognition methods (PSI-BLAST, SUPERFAMILY and Gene3D). The patterns of presence or absence of folds on genomes gives us insights into the relationships between folds and how we have arrived at the set of folds we see today. In particular, we develop a technique to estimate the relative ages of a protein fold based on genomic occurrence patterns in a phylogeny. We find that SCOP's `alpha/beta' class has relatively fewer distinct folds on large genomes, and that folds of this class tend to be older; folds of SCOP's `small protein' class follow opposite trends. Usage patterns show that folds with many copies on a genome are generally old, but that old folds do not necessarily have many copies. In addition, longer domains tend to be older and hydrophobic amino acids have high propensities for older folds whereas, polar - but non-charged - amino acids are associated with younger folds. Generally domains with stabilising features tend to be older. We also show that the reliability of fold recognition methods may be assessed using occurrence patterns. We develop a method, that detects false positives by identifying isolated occurrences in a phylogeny of species, and is able to improve genome wide fold recognition assignment sets. We use a structural fragment library to investigate evolutionary links between protein folds. We show that 'older' folds have relatively more such links than 'younger' folds. This correlation becomes stronger for longer fragment lengths suggesting that such links may reflect evolutionary relatedness.
author2 Deane, Charlotte M.
author_facet Deane, Charlotte M.
Abeln, Sanne
author Abeln, Sanne
author_sort Abeln, Sanne
title Protein fold evolution on completed genomes : distinguishing between young and old folds
title_short Protein fold evolution on completed genomes : distinguishing between young and old folds
title_full Protein fold evolution on completed genomes : distinguishing between young and old folds
title_fullStr Protein fold evolution on completed genomes : distinguishing between young and old folds
title_full_unstemmed Protein fold evolution on completed genomes : distinguishing between young and old folds
title_sort protein fold evolution on completed genomes : distinguishing between young and old folds
publisher University of Oxford
publishDate 2007
url http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.441063
work_keys_str_mv AT abelnsanne proteinfoldevolutiononcompletedgenomesdistinguishingbetweenyoungandoldfolds
_version_ 1716740809373515776