A holistic approach to understanding CAZy families through reductionist methods

  In a time when the amount of biological data present in the public domain is becoming increasingly vast, the need for good classification systems has never been greater. In the field of glycoscience the necessity of a good classification for the enzymes involved in the biosynthesis, modification a...

Full description

Bibliographic Details
Main Author: Eklöf, Jens
Format: Others
Language:English
Published: KTH, Glykovetenskap 2009
Subjects:
XET
PME
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-10183
http://nbn-resolving.de/urn:isbn:978-91-7415-269-2
Description
Summary:  In a time when the amount of biological data present in the public domain is becoming increasingly vast, the need for good classification systems has never been greater. In the field of glycoscience the necessity of a good classification for the enzymes involved in the biosynthesis, modification and degradation of polysaccharides is even more pronounced than in other fields. This is due to the complexity of the substrates, the polysaccharides, as the theoretical number of possible hexa-oligosaccharides from only hexoses exceeds 1012 isomers!  An initiative to classify enzymes acting on carbohydrates began around 1990 by the French scientist Bernard Henrissat. The resulting database, the Carbohydrate Active enzymes database (CAZy), classifies enzymes by sequence similarity into families allowing the inference of structure and catalytic mechanism. What CAZy does not provide however, are means to understand how members of a family are related, and in what way they differ from each other. The top-down approach used in this thesis, combining phylogenetic analysis of whole CAZy families, or sub-families, with structural determinations and detailed kinetic analysis allows for exactly that.   Finding determinants for transglycosylation versus hydrolysis within the xth gene product family of GH16 as well as restricting the hydrolytic enzymes to a well defined clade are integral parts of paper I. In paper II a new bacterial sub-clade within CE8 was discovered. The structural determination of theEscherichia coli outer membrane lipoprotein YbhC from from the new sub-clade explained the difference in specificity. The information provided in the two papers of this thesis gives a better understanding of the development of different specificities of diverse CAZY families as well as it aids in future gene product annotations. Furthermore this work has begun to fill the white spots uncovered in the phylogenetic trees.