Prediction of peptide retention time based on Gaussain Processes

Shotgun Proteomics is the leading technique for protein identification in complexmixtures. However, it produces a large amount of data which results in aextremely high computational cost for identifying the protein. Retention time(RT) is an important factor to be used to enhance the efficiency of pr...

Full description

Bibliographic Details
Main Author: Qiu, Xuanbin
Format: Others
Language:English
Published: KTH, Skolan för datavetenskap och kommunikation (CSC) 2015
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-175982
id ndltd-UPSALLA1-oai-DiVA.org-kth-175982
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-kth-1759822018-01-11T05:12:38ZPrediction of peptide retention time based on Gaussain ProcessesengQiu, XuanbinKTH, Skolan för datavetenskap och kommunikation (CSC)2015Peptideretention timegaussian processesComputer SciencesDatavetenskap (datalogi)Shotgun Proteomics is the leading technique for protein identification in complexmixtures. However, it produces a large amount of data which results in aextremely high computational cost for identifying the protein. Retention time(RT) is an important factor to be used to enhance the efficiency of protein identification.By predicting the retention time successfully, we could decrease thecomputational cost dramatically. This thesis uses a machine learning method,Gaussian Processes, to predict the retention time of a set of peptide in hand.We also implement a feature extraction method called Bag-of-Words to generatethe features for training the model. In addition, we also investigate theeffect of different types of optimization methods to the model’s parameters.The results show comparable precision of the prediction and relatively lowtime cost when comparing with the state-of-art prediction model. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-175982application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic Peptide
retention time
gaussian processes
Computer Sciences
Datavetenskap (datalogi)
spellingShingle Peptide
retention time
gaussian processes
Computer Sciences
Datavetenskap (datalogi)
Qiu, Xuanbin
Prediction of peptide retention time based on Gaussain Processes
description Shotgun Proteomics is the leading technique for protein identification in complexmixtures. However, it produces a large amount of data which results in aextremely high computational cost for identifying the protein. Retention time(RT) is an important factor to be used to enhance the efficiency of protein identification.By predicting the retention time successfully, we could decrease thecomputational cost dramatically. This thesis uses a machine learning method,Gaussian Processes, to predict the retention time of a set of peptide in hand.We also implement a feature extraction method called Bag-of-Words to generatethe features for training the model. In addition, we also investigate theeffect of different types of optimization methods to the model’s parameters.The results show comparable precision of the prediction and relatively lowtime cost when comparing with the state-of-art prediction model.
author Qiu, Xuanbin
author_facet Qiu, Xuanbin
author_sort Qiu, Xuanbin
title Prediction of peptide retention time based on Gaussain Processes
title_short Prediction of peptide retention time based on Gaussain Processes
title_full Prediction of peptide retention time based on Gaussain Processes
title_fullStr Prediction of peptide retention time based on Gaussain Processes
title_full_unstemmed Prediction of peptide retention time based on Gaussain Processes
title_sort prediction of peptide retention time based on gaussain processes
publisher KTH, Skolan för datavetenskap och kommunikation (CSC)
publishDate 2015
url http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-175982
work_keys_str_mv AT qiuxuanbin predictionofpeptideretentiontimebasedongaussainprocesses
_version_ 1718604767274467328