The Automation of Glycopeptide Discovery in High Throughput MS/MS Data

Glycosylation, the addition of one or more carbohydrates molecules to a protein, is crucial for many cellular processes. Aberrant glycosylation is a key marker for various diseases such as cancer and rheumatoid arthritis. It has also recently been discovered that glycosylation is important in t...

Full description

Bibliographic Details
Main Author: Swamy, Sajani
Format: Others
Language:en
Published: University of Waterloo 2006
Subjects:
Online Access:http://hdl.handle.net/10012/1185
id ndltd-WATERLOO-oai-uwspace.uwaterloo.ca-10012-1185
record_format oai_dc
spelling ndltd-WATERLOO-oai-uwspace.uwaterloo.ca-10012-11852013-01-08T18:49:25ZSwamy, Sajani2006-08-22T14:27:32Z2006-08-22T14:27:32Z20042004http://hdl.handle.net/10012/1185Glycosylation, the addition of one or more carbohydrates molecules to a protein, is crucial for many cellular processes. Aberrant glycosylation is a key marker for various diseases such as cancer and rheumatoid arthritis. It has also recently been discovered that glycosylation is important in the ability of the Human Immunodeficiency Virus (HIV) to evade recognition by the immune system. Given the importance of glycosylation in disease, major efforts are underway in life science research to investigate the glycome, the entire glycosylation profile of an organelle, cell or tissue type. To date, little bioinformatics research has been performed in glycomics due to the complexity of glycan structures and the low throughput of carbohydrate analysis. Recent advances in mass spectrometry (MS) have greatly facilitated the analysis of the glycome. Increasingly, this technology is preferred over traditional methods of carbohydrate analysis which are often laborious and unsuitable for low abundance glycoproteins. When subject to mass spectrometry with collision-induced dissociation, glycopeptides produce characteristic MS/MS spectra that can be detected by visual inspection. However, given the high volume of data output from proteome studies today, manually searching for glycopeptides is an impractical task. In this thesis, we present a tool to automate the identification of glycopeptide spectra from MS/MS data. Further, we discuss some methodologies to automate the elucidation of the structure of the carbohydrate moiety of glycopeptides by adapting traditional MS/MS ion searching techniques employed in peptide sequence determination. MS/MS ion searching, a common technique in proteomics, aims to interpret MS/MS spectra by correlating structures from a database to the patterns represented in the spectrum. The tool was tested on high throughput proteomics data and was shown to identify 97% of all glycopeptides present in the test data. Further, the tool assigned correct carbohydrate structures to many of these glycopeptide MS/MS spectra. Applications of the tool in a proteomics environment for the analysis of glycopeptide expression in cancer tissue are also be presented.application/pdf1572916 bytesapplication/pdfenUniversity of WaterlooCopyright: 2004, Swamy, Sajani. All rights reserved.Computer ScienceGlycomicsProteomicsAutomationCarbohydrates Structure DeterminationGlycoproteinsHigh ThroughputThe Automation of Glycopeptide Discovery in High Throughput MS/MS DataThesis or DissertationSchool of Computer ScienceMaster of Mathematics
collection NDLTD
language en
format Others
sources NDLTD
topic Computer Science
Glycomics
Proteomics
Automation
Carbohydrates Structure Determination
Glycoproteins
High Throughput
spellingShingle Computer Science
Glycomics
Proteomics
Automation
Carbohydrates Structure Determination
Glycoproteins
High Throughput
Swamy, Sajani
The Automation of Glycopeptide Discovery in High Throughput MS/MS Data
description Glycosylation, the addition of one or more carbohydrates molecules to a protein, is crucial for many cellular processes. Aberrant glycosylation is a key marker for various diseases such as cancer and rheumatoid arthritis. It has also recently been discovered that glycosylation is important in the ability of the Human Immunodeficiency Virus (HIV) to evade recognition by the immune system. Given the importance of glycosylation in disease, major efforts are underway in life science research to investigate the glycome, the entire glycosylation profile of an organelle, cell or tissue type. To date, little bioinformatics research has been performed in glycomics due to the complexity of glycan structures and the low throughput of carbohydrate analysis. Recent advances in mass spectrometry (MS) have greatly facilitated the analysis of the glycome. Increasingly, this technology is preferred over traditional methods of carbohydrate analysis which are often laborious and unsuitable for low abundance glycoproteins. When subject to mass spectrometry with collision-induced dissociation, glycopeptides produce characteristic MS/MS spectra that can be detected by visual inspection. However, given the high volume of data output from proteome studies today, manually searching for glycopeptides is an impractical task. In this thesis, we present a tool to automate the identification of glycopeptide spectra from MS/MS data. Further, we discuss some methodologies to automate the elucidation of the structure of the carbohydrate moiety of glycopeptides by adapting traditional MS/MS ion searching techniques employed in peptide sequence determination. MS/MS ion searching, a common technique in proteomics, aims to interpret MS/MS spectra by correlating structures from a database to the patterns represented in the spectrum. The tool was tested on high throughput proteomics data and was shown to identify 97% of all glycopeptides present in the test data. Further, the tool assigned correct carbohydrate structures to many of these glycopeptide MS/MS spectra. Applications of the tool in a proteomics environment for the analysis of glycopeptide expression in cancer tissue are also be presented.
author Swamy, Sajani
author_facet Swamy, Sajani
author_sort Swamy, Sajani
title The Automation of Glycopeptide Discovery in High Throughput MS/MS Data
title_short The Automation of Glycopeptide Discovery in High Throughput MS/MS Data
title_full The Automation of Glycopeptide Discovery in High Throughput MS/MS Data
title_fullStr The Automation of Glycopeptide Discovery in High Throughput MS/MS Data
title_full_unstemmed The Automation of Glycopeptide Discovery in High Throughput MS/MS Data
title_sort automation of glycopeptide discovery in high throughput ms/ms data
publisher University of Waterloo
publishDate 2006
url http://hdl.handle.net/10012/1185
work_keys_str_mv AT swamysajani theautomationofglycopeptidediscoveryinhighthroughputmsmsdata
AT swamysajani automationofglycopeptidediscoveryinhighthroughputmsmsdata
_version_ 1716572475719942144