DAROGAN : enzyme function prediction from multiple sequence alignments

The function of an enzyme is often dependent on a few key functional residues and the principal objective of this project was to develop a novel function prediction system which takes advantage of this by comparing the conserved amino acids in known enzyme families to those in a putative enzyme. Mul...

Full description

Bibliographic Details
Main Author: Hamilton, Russell S.
Published: University of Edinburgh 2006
Subjects:
Online Access:http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.651998
id ndltd-bl.uk-oai-ethos.bl.uk-651998
record_format oai_dc
spelling ndltd-bl.uk-oai-ethos.bl.uk-6519982016-06-21T03:21:06ZDAROGAN : enzyme function prediction from multiple sequence alignmentsHamilton, Russell S.2006The function of an enzyme is often dependent on a few key functional residues and the principal objective of this project was to develop a novel function prediction system which takes advantage of this by comparing the conserved amino acids in known enzyme families to those in a putative enzyme. Multiple sequence alignments of well characterised enzyme families (with an E.C. number assigned) are used to create unordered sets of conserved functional residues, termed <i>Treads</i>.  Comparison of a query proteins <i>Tread </i> to the reference <i>Treads</i> is undertaken by projecting them in multidimensional space and measuring distance between them. A major advantage of this prediction strategy implemented in DAROGAN is that it should be able to recognise similarities in the functions of enzymes that are not similar in structure or sequence. The method has been tested with regard to its ability to predict cofactor-dependencies toward pyridoxal-5’-phosphate, thiamine, glutathione and folic acid utilising enzymes. An area of application for DAROGAN is the prediction of previously described enzyme functions in organisms with completed genomes to which no gene and protein sequence could be assigned though the standard annotation processes. Investigations were made into the potential of utilising the DAROGAN method to propose candidates for the missing pyridoxal-5’-phosphate utilising enzymes in the <i>E. coli</i> genome according to EcoCyc. Candidates are proposed by assessing the 511 sequences from the GeneQuiz project, to which there are homologues in other species, but with uncertain functions. The assessment takes the form of using the DAROGAN method to determine the similarities of each of the sequences to the reference <i>Treads.</i>572.7University of Edinburghhttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.651998http://hdl.handle.net/1842/14972Electronic Thesis or Dissertation
collection NDLTD
sources NDLTD
topic 572.7
spellingShingle 572.7
Hamilton, Russell S.
DAROGAN : enzyme function prediction from multiple sequence alignments
description The function of an enzyme is often dependent on a few key functional residues and the principal objective of this project was to develop a novel function prediction system which takes advantage of this by comparing the conserved amino acids in known enzyme families to those in a putative enzyme. Multiple sequence alignments of well characterised enzyme families (with an E.C. number assigned) are used to create unordered sets of conserved functional residues, termed <i>Treads</i>.  Comparison of a query proteins <i>Tread </i> to the reference <i>Treads</i> is undertaken by projecting them in multidimensional space and measuring distance between them. A major advantage of this prediction strategy implemented in DAROGAN is that it should be able to recognise similarities in the functions of enzymes that are not similar in structure or sequence. The method has been tested with regard to its ability to predict cofactor-dependencies toward pyridoxal-5’-phosphate, thiamine, glutathione and folic acid utilising enzymes. An area of application for DAROGAN is the prediction of previously described enzyme functions in organisms with completed genomes to which no gene and protein sequence could be assigned though the standard annotation processes. Investigations were made into the potential of utilising the DAROGAN method to propose candidates for the missing pyridoxal-5’-phosphate utilising enzymes in the <i>E. coli</i> genome according to EcoCyc. Candidates are proposed by assessing the 511 sequences from the GeneQuiz project, to which there are homologues in other species, but with uncertain functions. The assessment takes the form of using the DAROGAN method to determine the similarities of each of the sequences to the reference <i>Treads.</i>
author Hamilton, Russell S.
author_facet Hamilton, Russell S.
author_sort Hamilton, Russell S.
title DAROGAN : enzyme function prediction from multiple sequence alignments
title_short DAROGAN : enzyme function prediction from multiple sequence alignments
title_full DAROGAN : enzyme function prediction from multiple sequence alignments
title_fullStr DAROGAN : enzyme function prediction from multiple sequence alignments
title_full_unstemmed DAROGAN : enzyme function prediction from multiple sequence alignments
title_sort darogan : enzyme function prediction from multiple sequence alignments
publisher University of Edinburgh
publishDate 2006
url http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.651998
work_keys_str_mv AT hamiltonrussells daroganenzymefunctionpredictionfrommultiplesequencealignments
_version_ 1718312340597768192